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Viral complacency 


The first outbreak of yellow fever in Angola in almost 30 years illustrates the danger of a short 


attention span when confronting epidemic threats. 


that the Ebola epidemic in West Africa no longer represents 

an international public-health emergency. But as experts also 
warned last week, there must be no let-up in improving readiness for 
the next Ebola outbreak — including the nightmare prospect of an 
epidemic in the megacities of Africa. 

An expert panel of researchers convened by the London-based 
Wellcome Trust and the Center for Infectious Disease Research and 
Policy at the University of Minnesota, Minneapolis, warned in particu- 
lar that support for Ebola vaccine research must not be allowed to slip. 
Although a successful experimental Ebola vaccine has been developed, 
much work remains to be done if safe and effective vaccines are to be 
ready in sufficient amounts to quickly nip future outbreaks in the bud. 

“As Ebola infection rates come under control it’s a huge concern that 
complacency sets in, attention moves to more immediate threats, and 
Ebola vaccine development is left half-finished,” cautioned Jeremy 
Farrar, director of the Wellcome. Too often in the past, the world has 
stumbled from epidemic to epidemic, failing to learn the lessons of the 
last. Emergency responses to the latest threat capture headlines, research 
and political attention. But too often this attention quickly fades. 

Yellow fever, a virus that kills many of those it infects, is just one 
example of the failure to sustain control efforts. The virus is spread by 
Aedes aegypti, a mosquito adapted to live in urban areas. An ongoing 
yellow-fever outbreak in Angola, the first for nearly 30 years, began in 
December and has since spread within the country, infecting at least 
490 people and killing 198. It now threatens the wider region. 

Yet mass vaccination and intensive mosquito-control programmes 
largely eliminated this vaccine-preventable mosquito-borne disease by 
the 1940s. In South America, where the disease was widespread, the 
mosquito vector was virtually wiped off the map by the 1970s. But the 
success of mosquito control led to complacency and scale-back. 

As a result, A. aegypti is now present across more of the continent 
than before control began. It is similarly resurgent in tropical and sub- 
tropical regions worldwide, resulting in sporadic outbreaks of yellow 
fever in at-risk countries. Moreover, the mosquito’ comeback has 
fuelled large urban outbreaks of dengue, chikungunya and now Zika 
viruses, with at least tens of millions of people infected. 

There’s a long list of other Aedes-borne viruses that are currently 
restricted to animal reservoirs in the wild. But some of these, including 
potentially deadly ones, will inevitably establish themselves in cities with 
Aedes mosquitoes. Given rampant urbanization throughout the tropics 
and subtropics, dense human and Aedes populations are ticking time 
bombs. The failure to sustain Aedes control illustrates the need for long- 
term persistence to curb epidemic threats. 

The Angola outbreak has already depleted an international emer- 
gency stockpile of 6 million doses of vaccine, leaving authorities 
scrambling to obtain extra vaccine from national immunization pro- 
grammes. The WHO and other international agencies launched the 


To World Health Organization (WHO) last week declared 


Yellow Fever Initiative in 2006 to reboot yellow-fever mass-vaccina- 
tion programmes and routine vaccination in the highest-risk African 
countries. But vaccine stocks are still insufficient, and vaccine coverage 
in many African countries too low, leaving many vulnerable. The prob- 
lem, says one official associated with the initiative, is that yellow fever 
is a “forgotten disease’, which makes it difficult to attract sustained 
political interest and funding. 

Following the Ebola epidemic in West 


“Too of tenin the Africa, a slew of commissions and reports 
past, the world laid out a broad consensus on what needs to 
has stumbled be done for a more proactive and sustained 
from epidemic preparedness against epidemic threats. 
to epidemic, These measures include reinforcing public- 
failing to learn health systems, surveillance and diagnostic 
thelessons ofthe capacities, and training health workers to 
last.” identify and respond early to disease out- 


breaks. Weaknesses in these areas have been 
identified as factors that allowed what was a small Ebola outbreak to 
spiral out of control. 

Monitoring viruses in the wild, and a better understanding of how 
factors such as deforestation, and the hunting and consumption of bush- 
meat, influence spillover of animal viruses into humans, is key. So, too, 
is the pre-emptive development of drugs and vaccines against known 
potential epidemic threats. 

The Ebola epidemic has prompted vigorous discussion on all these 
points, on what shape or form any new global initiatives should take, 
and where the required multibillion-dollar investment will come 
from. The big risk is that as the Ebola epidemic fades from memory, 
the sustained political commitment and funding required will not 
materialize, and business as usual will resume. That must not be 
allowed to happen. = 


Safety in neutrons 


To boost nuclear security, research reactors 
must eliminate highly enriched uranium. 


orking with the United States, Japan has removed all of the 
Wiss enriched uranium (HEU) and the separated pluto- 
nium from one of its nuclear reactor facilities, to minimize 
the risk of theft and use by terrorists. The two countries have now 
pledged to convert a second research reactor to use safer, low-enriched 


uranium. These are among the latest in a series of accomplishments that 
have stemmed from US President Barack Obama’s biennial Nuclear 
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Security Summit, which wrapped up on 1 April. More than 50 coun- 
tries attended, most represented by heads of state, making a variety of 
commitments to reduce the risk of nuclear terrorism. 

These projects are also a reminder of just how slow progress has 
been — and how much remains to be done. 

Obama launched the agenda in a 2009 speech in Prague, calling on 
governments to secure or eliminate all vulnerable nuclear materials in 
four years. His speech underscored the fact that the threat of a nuclear 
attack has increased even as the danger of apocalyptic nuclear warfare 
has receded. Recent revelations that the Islamist terrorist group ISIS may 
have been targeting a nuclear facility in Belgium make this all too clear. 

The initial focus has been on HEU, because of simple physics. 
Whereas plutonium must be compressed with explosives to produce 
a nuclear explosion — a feat that is probably beyond the technical 
capability of terrorist groups — the process is simpler for weapons- 
grade HEU, which is also used in many reactors. The United States 
and Russia, which have supplied the world with the bulk of HEU, 
have stepped up efforts to secure, remove or blend these materials 
into low-enriched uranium (LEU), which has 20% or less of the key 
isotope uranium-235. Security has been upgraded at 32 facilities, and 
12 countries have been declared HEU-free since 2010. 

Many of these materials are located at civilian research reactors. The 
risks were recognized long ago; in 1978, for example, the United States 
began eliminating HEU fuel in these reactors. In 1992, the US Congress 
enacted alaw requiring countries that receive its HEU to commit to con- 
verting reactors to LEU fuel. To maintain reactor performance, however, 
scientists needed to develop a new generation of high-density LEU fuels, 
which are now available for most research facilities. 

This is good news, but challenges remain. Existing high-density LEU 
fuels cannot be used without degrading performance in 11 specialized 


US and European research reactors. Certifying new fuels and converting 
these reactors could take nearly two decades. In January, the US National 
Academies of Science, Engineering, and Medicine recommended that 
specialized US reactors adopt an interim solution and convert to less- 
enriched fuel sources containing 30-45% uranium-235. This could — 
and should — be accomplished over several years, without impeding 
efforts to complete the shift to safer LEU fuel as soon as possible. 
Researchers also need a comprehensive 


“Research strategy to maintain research reactors. The 
reactors are European Commission is sponsoring a research 
just one part consortium called HERACLES to do just that. 


The White House Office of Science and Tech- 
nology Policy should convene agencies and 
research facilities to develop a path forward, and engage internation- 
ally. Many of these specialized research reactors are getting old; in some 
cases, given delays with new LEU fuels, it may make sense to start anew. 

But research reactors are just one part of the puzzle, and the question 
now is how to carry the broader nonproliferation agenda forward once 
Obama leaves office. His four nuclear summits have boosted political 
attention and accelerated progress, but the world is awash with nuclear 
materials. Nuclear safety and security falls to a problematic patch- 
work of international institutions, including the International Atomic 
Energy Agency (IAEA), Interpol and the United Nations, and the latest 
summit produced a variety of initiatives to bolster these institutions. 

That is a start. Ultimately, the world needs a new convention that 
sets specific standards for nuclear security and allows inspections and 
enforcement by the IAEA. In the meantime, governments must work 
through existing institutions to share and implement best practices. 
Regardless of cost, research facilities must ensure that their nuclear 
materials are safe and secure. m 


of the puzzle.” 


Mind matters 


Mentalillness is moving up the global 
agenda — but there is still much to do. 


13 and 14 April, the World Health Organization (WHO) and 

the World Bank will hold an unprecedented joint conference 
in Washington DC to discuss mental health as both a global disease 
and an economic problem. 

It is a welcome gesture after many snubs. Mental illness was left 
out of the United Nations’ influential high-level meeting on non- 
communicable diseases (NCDs) in 2011. Almost begrudgingly, the 
UN gave mental health a brief mention in the entry for NCDs in its 
2015 Sustainable Development Goals. As well as reducing deaths from 
NCDs such as heart disease by one-third, it said, the world should also 
“promote mental health and well-being”. 

The UN failed to recognize that in terms of impact, mental disorders 
are at least as harmful as those better-recognized and better-funded 
diseases. Mental-health conditions account for 37% of the healthy life 
years lost to NCDs. This reflects shortened lifespans and a loss of ability 
to work effectively, and it translated to a global loss of US$2.5 trillion in 
2010; that figure is projected to rise to $6 trillion by 2030 (see go.nature. 
com/dfdkbh). Simply ‘promoting’ mental health is not enough. 

Many developed countries scarcely give mental illness more than 
lip service. In the United States, for instance, the topic is brought up 
only when a mass shooting launches a ‘conversation’ about the poor 
state of mental-health treatment — and then only as a distraction from 
the gun-control debate. Such attention rarely leads to new funding for 
mental-health research. 

Underlying much of this neglect is the persistent bias, conscious or 


Nes! 2016 should be a good year for mental health. On 
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not, towards believing that many mental-health conditions are a moral 
failing rather than the result of complex biology, despite overwhelming 
evidence for the latter. For poor countries wracked by infectious and 
childhood diseases, it can be easier to confine people with psychosis 
than to try to treat them. And treatment is hard to find the world over: 
globally, there are only nine mental-health providers for every 100,000 
people, and some countries have only one or two providers. A Com- 
ment on page 25 calls for a global strategy to combat care deficiencies 
in both developing and developed countries. 

Even in developed countries, people find it difficult to consider 
depression as a condition to be combated with the same clarity of pur- 
pose as heart disease. This is exacerbated by the failure to develop drugs 
for depression that are as clear-cut and effective as statins. This is unlikely 
to change soon: if anything, neuroscience is painting a bleaker picture by 
showing how complex these diseases are. More than 100 genetic regions 
have been associated with schizophrenia — and autism and depression 
are probably even more complex. Addressing all of these will require 
entirely new approaches. A News Feature on page 20 looks at the latest 
developments on one emerging, but still unproven, front: phone apps 
intended to assist people with mental-health disorders. 

The annual Mental Illness Awareness Week in October promotes 
mental health, as do many organizations devoted to erasing the stigma 
and bias that harm patients and inhibit politicians. But so much more 
is needed in terms of improved treatments and access to care. 

Finance ministers at the meeting next week should recognize the 
positive economic returns of investing in this direction. The UN, the 
WHO, the World Bank and governments should be expected to con- 
tribute something tangible to this pressing issue. The summit’s attend- 
ees should come away with plans for creating specific development 
targets, and mechanisms for funding research, 
ensuring that treatment is available in low- 
income countries, and holding nations respon- 
sible for giving mental illnesses as prominent a 
place in health care as other NCDs. = 
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into its thinking and operations — and statements by everyone 

from Nature and the UK government to Paul Nurse, former 
president of the Royal Society, indicate that it is — then we social 
scientists must do more to make this happen. 

Our input is necessary because, too often, the reach and influence of 
research is discovered only with hindsight. Lessons are ‘learned’ only 
after the social implications of new domains of science and technol- 
ogy have provoked controversy or challenged existing norms. Social 
science can help to predict these implications and plan for them. It can 
also help to frame science questions to make them more sensitive to 
economic, cultural and social factors. 

Scientists have their own hands-on understanding of how the science 
and innovation system operates, and what demands it faces. Why should 
they pay attention to social-science contributions, particularly when 
these are not yet well known and understood 
across the scientific community? 

Social scientists understand that many col- 
leagues in the hard sciences are sceptical of what 
we can offer. We know that we need to make our 
contribution more widely understood and appre- 
ciated. This week, social scientists in Britain take 
what we hope will bea significant step. 

We (including myself and colleagues Robin 
Williams of the University of Edinburgh and Fred 
Steward of the Policy Studies Institute in London) 
are launching a professional association, with an 
explicit goal of engaging with science and science 
policy. Researchers in the social sciences have 
built links with specific scientific constituencies, 
but have failed to engage at a more general level. 

We argue that it is especially important to do this now, because all 
researchers are being urged to explore and make explicit how their 
work has reach and influence in the wider world. And we want to make 
clear that social science — especially science, technology and innova- 
tion studies (STIS) — should be integral to science and does not merely 
handle external issues, for example by addressing ‘public acceptance’ 

Our new body is called the Association for Studies in Innovation 
Science and Technology- UK (AsSIST- UK; assist-uk.com). It has been 
built over 18 months and has a strongly interdisciplinary membership 
of more than 200 people. Unlike similar bodies in mainland Europe and 
the United States, which tend to look inwards to the academic discipline, 
it is intended to take our methods and thinking into the broader world. 

One priority is to lobby for social-scientist involvement in the earli- 
est stages of research projects, when emerging 


I: the science community is serious about integrating social science 


ideas are most open to discussion. We want to DNATURE.COM 
work at national and regional levels, fromthe _ Discuss this article 
UK government and research-funding councils _ online at: 


to professional science bodies and the devolved —_go.niature.com/arstjn 


TO MAKE 
THE MOST OF 


SCIENCE, 
WE MUST 
KNOW HOW 
SCIENCE 


OPERATES. 


Recognize the value 
of social science 


A professional body for UK social scientists can help to improve research 
practice — and not just in public engagement, says Andrew Webster. 


governments in the four UK nations, which are experimenting with 
science and technology policies. 

Science and society are not discrete, as some researchers seem 
to assume. Knowledge — about the impacts of climate change, for 
example — gets its value and usefulness only when rooted in particular 
contexts. This makes it diverse and contested. From the perspective ofa 
social scientist, the challenge for researchers in genomics, for instance, 
is not to ‘communicate’ their ideas and discoveries to a homogenous 
‘public’: it is to realize that they are members of that public and act 
accordingly. Change in the direction of science cannot be viewed 
from a single position. It is relative and depends on the position of 
the observer, and on the interplay of diverse knowledge communities. 

Social science can help in interrogating the evidence and assump- 
tions behind theoretical models (say, for biomarkers of ageing). It can 
assess how technical standards are defined and applied to a new field, 
and how innovation shapes the way in which 
clinical trials are designed and conducted (as my 
own research has done in regenerative medicine). 
Models, standards and trials all rely on agreement 
about appropriate evidence and how it is used. 
This allows for fruitful discussion across the sci- 
ences and the early recognition that knowledge 
is provisional and may change — important if a 
project is to have scope for future debate. 

One of the first tasks for our new group is to 
review the research and activities of our mem- 
bers, to identify their existing impacts on science 
(through specific projects and membership of 
national bodies) and how they have engaged 
more broadly with the wider scientific com- 
munity. To increase that engagement, we aim 
to identify and share examples of good practice. We want to bring 
together STIS expertise in diverse fields such as biomedicine, energy 
and data analytics to inform research-council programmes — includ- 
ing the current move towards interdisciplinary doctoral training 
across the social and natural sciences. 

And we want to act as a national body that can contribute to specific 
fields of science from their earliest days onwards. Social scientists 
should help to plan for the possibilities of gene editing, for example, 
and anticipate the challenges posed by the diversification and growth 
of biobanks. 

To make the most of science, we must know how science operates, 
and understand the factors that influence it. Social scientists in the 
United Kingdom and elsewhere have been studying that for more than 
50 years. We are ready and able to help. m 


Andrew Webster is Director of the Science and Technology Studies 
Unit (SATSU) at the University of York, UK. 
e-mail: andrew. webster@york.ac.uk 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


| MEDICINE, 
Drug manufacture 
on demand 


A fridge-sized machine can 
produce multiple pharmacy- 
quality medicines on demand 
in hours, and could make drug 
supply chains more resistant to 
disruption. 

Klavs Jensen at the 
Massachusetts Institute of 
Technology in Cambridge 
and his colleagues developed 
a machine that can synthesize 
hundreds to thousands of 
doses of four common drugs: 
diazepam, diphenhydramine, 
fluoxetine and lidocaine. 
Their ‘plug-and-play’ system 
can conduct multiple-step 
synthesis from commercial 
starting materials, and can 
also purify and crystallize the 
resulting compounds into a 
useable product. 

Such a machine 
could one day allow 
on-demand production 
of medicines for hospitals, 
humanitarian operations and 
pharmaceutical research. 
Science 352, 61-67 (2016) 


Virus adaptation 
to mosquitoes 


Insect-borne viruses can lose 
some of their vigour during 
their time in mosquitoes 
before transmission to a 
vertebrate host. 

Gregory Ebel at Colorado 
State University in Fort 
Collins and his colleagues 
infected four mosquito species 
(including Aedes aegypti, 


ECOLOGY 


A path to better fisheries 


Better-managed fisheries could yield billions of 
extra dollars in profit and extract millions more 
tonnes of fish from the oceans. 

Many fish stocks are overexploited and poorly 
managed. Christopher Costello at the University 
of California, Santa Barbara, and his colleagues 
modelled the trajectories of 4,713 fisheries 
under various management systems. 

Compared to a ‘business-as-usual trajectory, 
improved management centred on fishing rights 
that boost the economic value of fish could 


pictured) with West Nile 
virus, which jumps back and 
forth between mosquitoes and 
birds. 

Fourteen days later, they 
isolated the virus from the 
insects’ saliva and organs 
and sequenced its RNA. 
They found that viruses from 
different mosquito species 
had acquired different sets 
of mutations. When they put 
the virus into avian cells, they 
found that these mutations 
made it more difficult for the 
virus to survive, suggesting 
that it had adapted to living 
in the mosquito. The findings 
may help to understand how 
insect-borne viruses are 
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increase fish numbers, the researchers estimate, 
and yield 16 million tonnes more fish by 2050. It 


would generate US$53 billion in profit and there 


spread, the researchers say. 
Cell Host Microbe http://doi.org/ 
bdxr (2016) 


White dwarf’s 
weird atmosphere 


Astronomers have discovered 
a white dwarf star that has 
an atmosphere made almost 
entirely of oxygen — the first 
of its kind to be discovered. 
White dwarfs are the dense 
cores that get left behind 
when a relatively small star’s 
nuclear fuel runs out and it 
sheds some of its outer layers. 
Usually their atmospheres 
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would be 619 million tonnes more biomass in 
the oceans as a result. The average overfished 
stock would recover in 10 years, and 98% 

of stocks would recover by 2050. Better 
management could improve food security, 
conservation and profits in nations that currently 
manage their fisheries poorly, say the authors. 
Proc. Natl Acad. Sci. USA http://doi.org/bdtd (2016) 


are made of hydrogen or 
helium, because these light 
elements float to the surface. 
By analysing light gathered by 
the Sloan Digital Sky Survey 
telescope, Kepler de Souza 
Oliveira Filho at the Federal 
University of Rio Grande do 
Sul in Porto Alegre, Brazil, 
and his colleagues identified 
a white dwarf that has been 
stripped of its light elements. 
This has left it with an 
atmosphere containing mostly 
oxygen, with traces of neon 
and magnesium. 

A violent pulse of burning 
carbon from inside the star 
ora fiery merger with a 
companion white dwarf could 
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have burned off the outer 
layers, suggest the authors. 
Science 352, 67-69 (2016) 


Enzymes help to 
regrow nerves 


A newly uncovered cell- 
signalling pathway could 
broaden strategies for 
repairing injured nerves. 
Adult neurons in the central 
nervous system (CNS) cannot 
regenerate damaged axons, 
the branches that conduct 
electrical signals. Researchers 
have previously managed to 
induce axon regeneration ina 
mouse retina by deleting a gene 
called Pten, finding that the 
process activated the protein 
complex mTORC1. Bo Chen 
at the Yale University School 
of Medicine in New Haven, 
Connecticut, and his colleagues 
have now identified a second 
regeneration pathway. They 
found that deleting Pten in 
mouse retinal neurons also 
inactivates the metabolic 
enzyme GSK36, and that 
deleting this enzyme in mice 
with optic-nerve injuries 
prompted the axons to regrow. 
The authors suggest that 
combined manipulation of the 
two Pten pathways could lead 
to treatments to repair injured 
axons in the CNS. 
eLife 5,e11903 (2016) 


Hunting habits of 
wild dogs tracked 


African wild dogs that live 
in woodland eschew the 
collaborative long-distance 
pursuit of prey used by their 
relatives on grass plains. 

Alan Wilson and Tatjana 
Hubel at the Royal Veterinary 
College in Hatfield, UK, and 
their colleagues tracked a 
pack of six wild dogs (Lycaon 
pictus) in Botswana’s woodland 
savannah, where most 
populations of this endangered 
animal now live. They found 
that the animals used multiple 
short-distance attacks in their 
attempts to capture prey. 
Unlike in dog packs on the 


plains, there was no observed 
cooperation between the 
dogs other than travelling 
together and sharing their 
kills. Although only 15.5% of 
individual attacks resulted in 
akill, sharing of food ensured 
that all animals ate regularly. 
In another paper, the same 
team modelled the energy use 
and benefits of this hunting 
strategy. Whereas the low-cost, 
short-distance chase of wild 
dogs is more costly per kill 
than for individual hunting by 
cheetahs (Acinonyx jubatus), 
this is outweighed by the 
sharing of prey. The dogs might 
be better equipped to deal with 
different habitats than was 
thought, the authors say. 
Nature Commun. 7, 11033 
(2016); Nature Commun. 7, 
11034 (2016) 


Prostate cancer 
under attack 


A nuclear-receptor protein 
called ROR-y has a key role in 
making prostate cancer deadly. 
In many cases of prostate 
cancer, the gene that encodes 
the androgen receptor 
becomes overactive and 
drives tumour progression, 
leading to fatal tumours that 
are resistant to castration 
therapy and contain high levels 
of ROR-y. To understand its 
role, Hong-Wu Chen at the 
University of California, Davis, 
Yong Xu at the Guangzhou 
Institutes of Biomedicine 
and Health, China, and their 
team suppressed ROR-y in 
cancerous cell lines and in 
human tumours transplanted 
into mice. This suppression 
blocked expression of the 
androgen-receptor gene and 
inhibited tumour growth, 
suggesting that ROR-y isa 
crucial component of the 
castration-resistant tumours. 
Current therapies target 
androgen-receptor levels, but 
mutations in tumours often 
make them resistant to such 
treatment. Targeting ROR-y 
could help patients to fight 
advanced prostate cancer. 
Nature Med. http://doi.org/bdtk 
(2016) 
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ARCHAEOLOGY 


Neanderthal 
uncovered 


With the help ofa protein- 
fingerprinting technique, 
researchers have identified 
one small chunk of bone out 
of thousands as belonging to a 
Neanderthal. 

Bone fragments from 
archaeological sites are 
often too small to identify 
the species of origin. 
Samantha Brown at the 
University of Oxford, UK, 
and her colleagues analysed 
2,315 such pieces from the 
Denisova Cave in southern 
Siberia — where Denisovans 
(an archaic human group) 
were first identified. 

The team extracted proteins 
from each fragment, finding 
human collagen protein in 
only one. Mitochondrial DNA 
sequencing and radiocarbon 
dating then showed that 
this 1.7-gram piece was 
from a Neanderthal — nota 
Denisovan — living more than 
50,000 years ago. 

Sci. Rep. 6, 23559 (2016) 


Fossil snake 
colours revealed 


Fresh evidence of coloration 
in ancient animals has been 
discovered in the 10-million- 
year-old fossilized remains of 
asnake. 

The colours of many extinct 
animals, including early 
birds, reptiles, mammals 
and amphibians, have been 
reconstructed through 
evidence of melanin pigments 
preserved in fossils. Maria 
McNamara at University 
College Cork, Ireland, and 
her colleagues now report 
evidence of coloration created 
by a different type of pigment 
called a carotenoid, preserved 
in minerals that replaced the 
tissues during fossilization. 

The team discovered 
pigment cells, called 
chromatophores, that would 
have contained carotenoids 
and light-reflecting granules in 
the skin of the fossilized snake, 


which was discovered near 
Teruel in Spain. The findings 
suggest that the animal had 

a pale belly and a green back 
and sides, with brown-black 
and yellow-green blotches — 
possibly a form of camouflage. 
The coloration could also have 
been a way to signal to other 
snakes. 

Curr. Biol. http://doi.org/bdxx 
(2016) 


Black-hole 
crackle-and-pop 


Gravitational ‘noise’ could be 
easier to detect than previously 
thought. 

Gravitational waves 
were first observed in 2015 
by the Advanced Laser 
Interferometer Gravitational- 
Wave Observatory (LIGO) 
detectors (pictured). This 
demonstrated the existence 
of binary black holes and 
suggested that these are 
more abundant than some 
previous estimates indicated. 
Tania Regimbau, at the Cote 
dAzur Observatory in Nice, 
France, and her colleagues 
from the LIGO/Virgo 
collaboration calculated the 
odds of detecting a constant 
background noise made by 
similar black-hole mergers 
across the observable Universe 
that are too distant to be 
detected individually. They 
find that LIGO and Virgo 
could potentially detect this 
background when they reach 
peak sensitivity in about four 
years time. 
Phys. Rev. Lett. 116, 131102 
(2016) 
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SEVEN DAYS nescnss 


Fraud punished 


A Parkinson's disease 
researcher in Australia pleaded 
guilty to research fraud and was 
handed a two-year suspended 
prison sentence by a court in 
Brisbane on 31 March. Bruce 
Murdoch, formerly of the 
University of Queensland in 
Brisbane, was found to have 
falsified results published in the 
European Journal of Neurology 
in 2011; three of his papers have 
been retracted. In a statement 
to the blog Retraction Watch, 
University of Queensland 
vice-chancellor Peter Hoj 

said that the university 

had reimbursed around 
Aus$175,000 (US$132,000) to 
funding bodies associated with 
Murdoch's work. 


EVENTS 


Emergency over 

The Ebola outbreak that has 
killed more than 11,000 people 
in Guinea, Liberia and Sierra 
Leone is no longer a public- 
health emergency, said the 
World Health Organization 
(WHO) in Geneva, 
Switzerland, on 29 March. 
The WHO noted that the 
virus is now unlikely to spread 
internationally, and that the 
affected countries have the 
capacity to deal with new 
cases. A campaign in Guinea 
last month administered an 
experimental Ebola vaccine 

to nearly 800 people who may 
have come into contact with 

8 individuals with the virus. 
Liberia has recorded two new 
cases since the announcement: 
a woman who died on 

31 March and her five-year- 
old son. 


ET search starts 

The SETI Institute in Mountain 
View, California, has kicked 
offa search for signals from 
extraterrestrial civilizations 
that might be living on 


Ice wall to stem Fukushima leak 


The Tokyo Electric Power Company (TEPCO) 
on 31 March began freezing the soil surrounding 
reactors 1 to 4 of the disaster-stricken Fukushima 
Daiichi nuclear power plant. A refrigeration 
system (pictured) is creating a 30-metre deep, 
1.5-kilometre-long wall of frozen ground that 
aims to stop groundwater from flowing under 


planets orbiting any of the 
20,000 nearest red dwarf stars. 
Red dwarfs are dimmer and 
cooler than the Sun, but they 
make up the bulk of stars in the 
Galaxy, increasing the odds of 
finding life there. The two- 
year search will be conducted 
at the Allen Telescope Array 
in northern California, the 
institute announced on 

30 March. 


China’s Go 


A team of Chinese scientists 
plans to challenge Google 
DeepMind’s Go-playing 
artificial-intelligence algorithm 
with its own program by the 
end of 2016, Chinese state news 
has reported. The DeepMind 
program, known as AlphaGo, 
beat a leading human player, 
South Korea's Lee Sedol, by 
four games to one in March. 
Reporting from an event 
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organized by the Chinese Go 
Association and the Chinese 
Association for Artificial 
Intelligence, Shanghai Securities 
News said on 31 March that a 
team from China will issue the 
challenge by the end of the year. 


Science in space 


Two scientific payloads 
travelled to the edge of space 
on 2 April in the latest test 

of Blue Origin’s commercial 
space vehicle, New Shepard. 
Amazon founder Jeff Bezos’s 
reusable spacecraft took off 
from a Texas launch site, flew 
to an altitude of 103 kilometres 
and successfully landed 

11 minutes later. On board 
was an experiment from the 
University of Central Florida 
in Orlando investigating 
how dust and rubble settle in 
microgravity, and the ‘box of 
rocks experiment from the 
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the plant and carrying radioactive isotopes into 
the sea. More than 300 tonnes of water per day 
are pumped into the damaged reactors to stop 
their cores from overheating and contributing 
to the radioactivity leak. TEPCO expects that it 
will take months for the ¥35-billion (US$316- 
million) project to seal the zone. 


Southwest Research Institute 
in San Antonio, Texas, to work 
out how regolith forms on 
asteroids. 


} RESEARCH 
Crater drilling 


A two-month expedition to 
drill into the 200-kilometre- 
wide Chicxulub crater, which 
formed 66 million years ago 
when an enormous asteroid 
smashed into the planet, began 
on 1 April. The aftermath of the 
impact obliterated most life on 
Earth, including the dinosaurs. 
From a drill-ship off the coast 
of Yucatan, Mexico, researchers 
will start to penetrate one of 
Chicxulubs most striking 
features — its ‘peak ring, a 
circle of mountains within the 
crater floor. Scientists have 

yet to fully explain how peak 
rings form, even though they 
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are common in big impact 
craters across the Solar System. 
See go.nature.com/pgxb18 for 
more. 


Gorilla decline 


Numbers of the largest primate 
on the planet, Grauer’s gorilla 
(Gorilla beringei graueri), 

have plummeted since 1995, 
according to a report from the 
Wildlife Conservation Society 
(WCS). The report, published 
on 4 April, says that the 
numbers have dropped from 
an estimated 17,000 in 1995 to 
3,800 today, a 77% decrease. 
Grauer’s gorillas live in the wild 
only in the eastern Democratic 
Republic of the Congo, and the 
WCGS report blames the decline 
on illegal hunting around 
mining sites, the civil war in the 
country from 1996 to 2003, and 
habitat loss. 


Nuclear security 


More than 50 countries, most 
represented by their heads 

of state, made a variety of 
commitments intended to 
prevent nuclear terrorism at 
the conclusion ofa nuclear 
summit in Washington DC 
on 1 April. The meeting was 
the fourth biennial summit in 
a process initiated in 2009 by 
US President Barack Obama 
(pictured with Canadian 
Prime Minister Justin 
Trudeau). Much of the focus 
has been on reducing civilian 


TREND WATCH 


More of the global population is 
now obese than is underweight, 
according to a study of 

186 countries from 1975 to 2014 
(see Lancet 387, 1377-1396; 
2016). The proportion of obese 
men more than tripled and 

that of obese women more 

than doubled during that 
period. Many people are still 
underweight in the world’s 
poorest regions, particularly in 
Asia and Africa. But the global 
average weight of a person grew 
by 1.5 kilograms each decade. See 
go.nature.com/yslifh for more. 


stocks of highly enriched 
uranium at research reactors. 
At least 28 reactors have 

been shut down or converted 
to low-enriched uranium 
since 2009, but challenges 
remain in converting 11 high- 
performance research reactors. 
See page 5 for more. 


Patent pledge 
Drugmaker GlaxoSmithKline 
(GSK) has announced plans to 
improve access to its medicines 
in the world’s poorest countries. 
The company said on 31 March 
that it would stop filing drug 
patents in many developing 
countries. That means that 
generic manufacturers in 

those nations would be able 

to supply copycat versions of 
GSK’s drugs without worrying 
about lawsuits. GSK also 
signalled that it intended to 
improve access to low-cost 
drugs that can help to address 
the growing burden of cancer 
in the developing world. 


Public-health advocates have 
embraced the news and are 
urging other drug companies 
to follow suit. See go.nature. 
com/nghggj for more. 


FACILITIES 


Laser beam added 
The Linac Coherent Light 
Source (LCLS), the world’s 
brightest X-ray free-electron 
laser, began a US$1-billion 
construction project on 4 April 
to adda second beam. LCLS-II, 
based at the SLAC National 
Accelerator Laboratory in 
Menlo Park, California, will 
accelerate electrons through 
superconducting niobium 
cavities to produce X-ray 
pulses 10,000 times more 
concentrated and firing 

8,000 times faster than X-rays 
produced by the $414-million 
LCLS, which started operations 
in 2009. This will enable it to 
image processes that occur 

on smaller scales and faster 
timescales. Construction will 
last until the early 2020s. 


OBESITY RISE OVER 40 YEARS 


Data from 1975 to 2014 show that obesity has risen dramatically, 
more than tripling in men and more than doubling in women. 
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SEVEN DAYS | THIS WEEK | 


11-13 APRIL 

The 43rd session of 
the Intergovernmental 
Panel on Climate 
Change convenes in 
Nairobi. 
go.nature.com/hdodfh 


12-14 APRIL 
London hosts the 2016 
Obesity Summit. 
go.nature.com/kzhswf 


12-15 APRIL 

Water across the 
Universe and its origins 
will be discussed 

in Noordwijk, the 
Netherlands. 
go.nature.com/Incjsb 


Career boost 

Four philanthropic 
organizations have created 

an international research 
programme focused on early- 
career scientists. Announced 
on 29 March, the International 
Research Scholars Program 
will select up to 50 members 
who are not originally from one 
of the G7 countries, but have 
trained in the United States or 
the United Kingdom for at least 
one year. Awardees will each 
receive a total of US$650,000 
over five years. The sponsors 
are: the Howard Hughes 
Medical Institute in Chevy 
Chase, Maryland; the Bill & 
Melinda Gates Foundation 

in Seattle, Washington; the 
London-based Wellcome 
Trust; and Lisbon’s Calouste 
Gulbenkian Foundation. 


CORRECTION 

The item ‘Boaty McBoattface’ 
(Nature 531, 419; 2016) 
wrongly described NERC 

as the ‘National’ instead of 
the Natural Environment 
Research Council. 


> NATURE.COM 
For daily news updates see: 
WWW.nature.com/news 
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US Republican presidential candidate Donald Trump has proposed building a wall between the United States and Mexico. 
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Trump’s immigration stance 
stokes fears for science 


Rhetoric in US presidential campaign concerns researchers — particularly Muslims. 


BY HEIDI LEDFORD 


azi Nalim has lived in the United States 
R= 30 years. An engineer at Indiana 
University-Purdue University Indianap- 
olis, he often travels around the world to recruit 
science and engineering students to his univer- 
sity. But last week, on the cusp ofa recruitment 
trip to India, he hesitated when asked whether 
he would still encourage foreign, Muslim stu- 
dents to work or study in the United States. 
“I would still say the opportunity for doing 


cutting-edge science here is unmatched,” said 
Nalim, who is Muslim. “Where I think I would 
caution people to think more carefully is longer 
term: where would they want to live and raise 
a family? That's a harder question to answer.’ 
For Nalim and others, the roots of such 
concerns are apparent. In December, US pres- 
idential candidate Donald Trump, who has 
campaigned against immigration, boasted 
that he would ban Muslims from entering the 
country if elected. (On 30 March, Trump — 
now the Republican front runner — said that 
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he would make exceptions for some Muslims, 
notably his wealthy Muslim friends.) 

Science advocates worry that Trump's 
broader anti-immigration stance could pose 
a threat to US research dominance. Roughly 
5% of all students in the United States hail 
from other countries — including more than 
380,000 people studying science, engineering, 
technology or mathematics. “We've always 
been a nation which has welcomed scientific 
brainpower from other countries,” says Mary 
Woolley, president of Research! America, a 
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> science-advocacy group in Alexandria, Vir- 
ginia. “We don't want that to turn around now.’ 

Scientific issues have scarcely been men- 
tioned on the campaign trail so far. Hillary 
Clinton, the Democratic front runner, has 
pledged to boost support for research into Alz- 
heimer’s disease, and has pushed back against 
Trump’s anti-immigration and anti-Muslim 
stance. When she was a senator, Clinton backed 
health and research-related bills, and as first 
lady to former president Bill Clinton, she advo- 
cated for research on women’ health. 

Trump is a wealthy real-estate mogul with no 
political legacy to mine for clues as to his scien- 
tific opinions. In the course of the campaign, he 
has linked autism to childhood vaccines, and 
dismissed climate change. (“It’s called weather,’ 
he said.) In October, conservative radio host 
Michael Savage suggested on air that if elected, 
Trump should appoint him as head of the US 
National Institutes of Health (NIH). “Well, you 
know youd get common sense if that were the 
case, that I can tell you,’ Trump replied, during 
the light-hearted conversation. “Because I hear 
so much about the NIH, and it’s terrible.” 

With little more than this to go on, advo- 
cates of science funding are worried. “It feels 
like there’s a lot of cynicism toward science 
and scientists, and that’s concerning,” says 
Benjamin Corb, public-affairs director at 


the American Society for Biochemistry and 

Molecular Biology in Rockville, Maryland. 
Trump’s position on immigration is clearer. 
He frequently boasts that if elected, he would 
build a wall along the border with Mexico — 
and force Mexico to pay for it — which has 
earned him both supporters and derision. A 
President Trump 


“There’salotof could bode ill for 
cynicism toward _ long-running efforts 
science and to boost the number 


scientists.” of foreign profes- 
sionals working in 
the United States on visas for highly skilled 
workers, known as H-1Bs. But Trump’s state- 
ments regarding H-1B visas have been difficult 
to parse. At times, he has advocated bringing 
skilled workers into the country; at others, he 
has said that the H-1B programme is too often 
abused and should be restricted. 

Such statements worry Brad Hayes, a com- 
puter scientist at the Massachusetts Institute of 
Technology in Cambridge. Hayes is an US citi- 
zen, but says that some of his most outstanding 
colleagues are not. “A lot of them want to end 
up here after they get their PhDs, but now that’s 
in doubt, he says. “We absolutely want these 
people to stay. If they get lumped in with this 
‘close our borders, keep everybody out; we're 
doing ourselves a disservice.” 


Hayes inadvertently cast a spotlight on the 
simplicity of Trump’s rhetoric when he decided 
to use a neural network to model Trump's 
noticeably repetitive and simplistic speech 
patterns. He has been posting the results — 
computer-generated parody quotes based on 
Trump’s campaign speeches — on Twitter 
using the handle @DeepDrumpf. (Trump's 
ancestral name, Drumpf, was changed by the 
family several generations ago.) 

“We're going to build the wall? says one tweet, 
in reference to Trump’s Mexico plan. Hayes says 
that the project was only meant to be fun, but it 
ended up making a point. “A lot of the rhetoric 
that’s being used is fairly content-light” 

But that rhetoric is having an effect, says 
Ehab Abouheif, a developmental biologist at 
McGill University in Montreal, Canada, who 
is Muslim. On a recent trip to be interviewed 
for a position in the United States, recruiters’ 
“constant question was, ‘Are you really sure you 
would want to come?’” he says. “My scientist 
colleagues are really scared.” 

To Abouheif, who fondly remembers com- 
pleting his PhD and his postdoc in the United 
States, the current climate is surreal. “If you are 
trying to stop Muslims from coming in, it means 
that the ones who are there already are not going 
to feel comfortable either, he says. “It would be 
ashame to alienate this big swathe of society.” m 


COSMOLOGY 


Controversial dark-matter 
claim faces ultimate test 


Multiple teams finally have the material they need to repeat enigmatic experiment. 


BY DAVIDE CASTELVECCHI 


matter research: a claimed detection that is 

hard to believe, impossible to confirm and 
surprisingly difficult to explain away. Now, 
four instruments that will use the same type of 
detector as the collaboration behind the claim 
are in the works or poised to go online. Within 
three years, the experiments will be able to 
either confirm the existence of dark matter — 
or rule the claim out once and for all, say the 
physicists who work on them. 

“This will get resolved,” says Frank Calaprice 
of Princeton University in New Jersey, who 
leads one of the efforts. 

The original claim comes from the DAMA 
collaboration, whose detector sits in a labora- 
tory deep under the Gran Sasso Massif, east of 
Rome. For more than a decade, it has reported 


L is the elephant in the room for dark- 
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overwhelming evidence! for dark matter, an 
invisible substance thought to bind galaxies 
together through its gravitational attraction. 
The first of the new detectors to go online, in 
South Korea, is due to start taking data in a few 
weeks. The others will follow over the next few 
years in Spain, Australia and, again, Gran Sasso. 
All will use sodium iodide crystals to detect 
dark matter, which no full-scale experiment 
apart from DAMAs has done previously. 

Scientists have substantial evidence that 
dark matter exists and is at least five times as 
abundant as ordinary matter. But its nature 
remains a mystery. The leading hypothesis is 
that at least some of its mass is composed of 
weakly interacting massive particles (WIMPs), 
which on Earth should occasionally bump into 
an atomic nucleus. 

DAMA’s sodium iodide crystals should 
produce a flash of light if this happens in the 
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detector. And although natural radioactivity 
also produces such flashes, DAMA’s claim to 
have detected WIMPs, first made in 1998, rests 
on the fact that the number of flashes produced 
per day has varied with the seasons. 

This, they say, is exactly what is expected 
if the signal is produced by WIMPs that rain 
down on Earth as the Solar System moves 
through the Milky Way’s dark-matter halo”. In 
this scenario, the number of particles crossing 
Earth should peak when the planet’s orbital 
motion lines up with that of the Sun, in early 
June, and should hit a low when its motion 
works against the Sun’s, in early December. 

There is one big problem. “Tf it’s really 
dark matter, many other experiments should 
have seen it already,” says Thomas Schwetz- 
Mangold, a theoretical physicist at the Karlsruhe 
Institute of Technology in Germany — and 
none has. But at the same time, all attempts to 
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find weaknesses in the DAMA 
experiment, such as environmen- 
tal effects that the researchers had 
not taken into account, have failed. 
“The modulation signal is there,” 
says Kaixuan Ni at the University 
of California, San Diego, who 
works on a dark-matter experi- 
ment called XENONIT. “But how 
to interpret that signal — whether 
it's from dark matter or something 
else — is not clear” 

No other full-scale experi- 
ment has used sodium iodide in 
its detector, although the Korea 
Invisible Mass Search (KIMS), in 
South Korea, used caesium iodide. 
So the possibility remains that dark 
matter interacts with sodium ina 
different way to other elements. 
“Not until someone has turned 
on a detector made of the same 
material will you grow convinced 
that nothing is there,” says Juan 
Collar at the University of Chicago, 
Illinois, who has worked on several 
dark-matter experiments. 

Many have found it challenging 
to grow sodium iodide crystals 
with the required purity. Contam- 
ination by potassium, which has 
a naturally occurring radioactive 
isotope, is a particular problem. 

But now three dark-matter- 
hunting teams — KIMS; DM-Ice, run from Yale 
University in New Haven, Connecticut; and 
ANAIS, at the University of Zaragoza, Spain — 
have each obtained crystals with about twice the 
level of background radioactivity of DAMAs. 
That is pure enough to test its results, they say. 

The KIMS and DM-Ice teams have built a 
sodium iodide detector together at Yangyang 
Underground Laboratory, 160 kilometres 
east of Seoul. This instrument uses an ‘active 
veto’ sensor that will enable it to separate the 
dark-matter signal from the noise better than 
DAMA does, says Yeongduk Kim, the direc- 
tor of South Korea’s Center for Underground 
Physics in Daejeon, which manages KIMS. 

ANAIS is building a similar detector in the 
Canfranc Underground Laboratory in the 
Spanish Pyrenees. Together, KIMS/DM-Ice 
and ANAIS will have about 200 kilograms of 
sodium iodide, and they will pool their data. 


TOP STORY 


MORE 
ONLINE 


< 
rs ~ => 4 


The DAMA team uses sodium iodide housed in copper to hunt for dark matter. 


That is comparable to DAMAs 250 kilograms, 
enabling them to catch a similar number of 
WIMPs, they say. Even though the newer 
detectors will have higher levels of background 
noise, it should still be possible to either falsify 
or reproduce the very large DAMA signal, says 
Reina Maruyama of Yale, who leads DM-Ice. 

But Calaprice argues that high purity is more 
important than mass. He and his collaborators 
have developed a technique to lower contami- 
nation, and in January announced that they 
were the first to obtain crystals purer than 
DAMASs. He expects to reduce the background 
levels further, to one-tenth of DAMAs. 

The project, SABRE (Sodium-iodide with 
Active Background Rejection), will put one 
detector at Gran Sasso and the other at the 
Stawell Underground Physics Laboratory, 
which is being built in a gold mine in Victoria, 
Australia. SABRE will also use a sensor to pull 
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out the dark-matter signal from 
noise, and will have a total mass of 
50 kilograms. 

SABRE should complete its 
research and development stage 
in about a year, and will build 
its detectors soon after that, says 
Calaprice. It will then make its 
technology available to other labs 
— something that DAMA did not 
do. And having twin detectors in 
both the Northern and South- 
ern hemispheres could clarify 
whether environmental effects 
could have mimicked dark mat- 
ter’s seasonality in DAMAs results 
— if the signal is from WIMPs, 
then both detectors should see 
peaks at the same time. 

DAMA will wait at least until 
2017 to release its latest results, 
says spokesperson Rita Bernabei 
of the University of Rome Tor Ver- 
gata. She is not holding her breath 
about the upcoming sodium 
iodide detectors. “Our results have 
already been verified in countless 
cross-checks in 14 annual cycles, 
so we have no reason to get excited 
about what others may do,” she 
says. If other experiments do not 
see the annual modulation, she 
adds, her collaboration will con- 
clude that they do not yet have 
sufficient sensitivity. 

Could the teams prove DAMA right? “I 
was unwilling to believe the DAMA results or 
even take them seriously at first,” says Kather- 
ine Freese, a theoretical astroparticle physicist 
at the University of Michigan in Ann Arbor, 
who with her collaborators first proposed 
the seasonal modulation technique used by 
DAMA”. But, in part because of a lack of any 
other explanation for their signal, she is now 
more hopeful. The fact that many have tried 
and failed to repeat DAMAs experiment shows 
that it is not easy, says Elisabetta Barberio at the 
University of Melbourne, who leads the Austral- 
ian arm of SABRE. “The more one looks into 
their experiment, the more one realizes that it 
is very well done.” m 
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Researchers are trying to establish whether Zika virus causes the birth defect microcephaly. 


PUBLIC HEALTH 


Zika highlights role of 
fetal-tissue research 


Controversial tissue studies could prove crucial to probing 
link between virus and birth defects. 


BY ERIKA CHECK HAYDEN 


protein that helps Zika virus infect 
A= skin cells might also give the virus 

access to stem cells that make brain 
cells, suggests a study carried out on donated 
human fetal tissue. 

The result — published on 30 March in 
Cell Stem Cell' — is part of a growing body 
of research that seeks to determine how Zika 
might cause birth defects, but that requires a 
type of tissue that is increasingly controversial 
for researchers in the United States. 

Recent advances in neuroscience and cell 
technology have given hints as to why some 
babies born to Zika-infected mothers have 
abnormally small heads — a condition called 
microcephaly — and other birth defects. But 
to fully understand what is happening in the 
womb, some scientists say that they need to 
study tissue from fetuses, which can be donated 
by couples who terminate pregnancies. 

Researchers already knew that a protein 
called AXL enabled Zika to enter human skin 
cells. Now, Arnold Kriegstein, a neuroscientist 
at the University of California, San Francisco, 
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(UCSF) and his colleagues show that the 
protein is also made by cells in the fetus that 
form the eyes and the brain. AXL could pro- 
vide a means for Zika virus to infect these cells. 
Two other studies published this month” 
showed that Zika specifically targets and 
kills neuron-forming 


“Manyfewer cells, including those in 
people are organoids — brain-like 
willing to structures derived from 
donate, and reprogrammed human 
it’s slowing us skin cells. These stud- 
down.” ies suggest that Zika 


causes microcephaly by 
damaging fetal cells that make the brain, says 
neuroscientist Patricia Pestana Garcez of the 
Federal University of Rio de Janeiro, Brazil, 
who led one of the studies. 

Kriegstein’s study used fetal tissue donated 
by patients treated at UCSF medical facilities. 
But such material may get harder to come by, 
because the collection and use of fetal cells is 
under renewed scrutiny in the United States. 
Last July, an anti-abortion group called the 
Center for Medical Progress in Irvine, Cali- 
fornia, released video of employees from 
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the non-profit health-care provider Planned ¢ 
Parenthood discussing the sale of fetal tissue 
from abortions for research. Members of the 
US House of Representatives are now investi- 
gating the use of fetal tissue in research. 

Scientists in the United States worry that the 
controversy could hamper essential research 
on the Zika virus. “Many fewer people are will- 
ing to donate, and it’s slowing us down,” says 
Susan Fisher, a stem-cell and developmental 
biologist at UCSF. 

Fisher is studying how Zika virus is 
transmitted from mother to baby. She has 
found AXL in fetal cells called tropho- 
blasts that anchor the placenta, which sup- 
plies a fetus with blood and nutrients, to the 
mother’s uterus. These cells are known to 
transmit infections such as cytomegalovirus 
from mother to baby. “This suggests that the 
placenta is extremely capable of transmitting 
Zika,” says Fisher, whose studies rely on fetal 
tissue donated from terminated and full-term 
pregnancies. 

Carolyn Coyne, a virologist at the University 
of Pittsburgh in Pennsylvania, says that fetal 
tissue is particularly crucial for studies of Zika 
because the virus seems to be able to harm a 
fetus throughout pregnancy’. “It is absolutely 
essential to study Zika infection in human 
fetal tissue,” says Coyne. “These types of stud- 
ies need to extend to all stages of pregnancy.’ 

Because abortion is illegal or highly 
restricted in many Latin American countries, 
laboratory research on neural development in 
the regions hit hardest by Zika relies mainly 
on other types of human tissue, such as orga- 
noids. Researchers in Brazil, for example, are 
studying the lethality of different Zika viruses 
in neurons and organoids derived from cord 
blood. 

Both Fisher and Kriegstein are planning 
further studies to test how Zika infects devel- 
oping brain and placental cells. They argue that 
such studies are crucial to establish why the 
virus damages babies’ brains, and whether this 
can be prevented. 

The scientists will also use organoids and 
animal models, but they note that neither of 
these is a perfect substitute for human fetal 
tissue. For instance, researchers aren't sure 
how faithfully the growth of brain organoids 
replicates human brain development. “It'll be 
important to demonstrate in human tissue 
exactly how the virus is creating the pattern of 
damage that is emerging,’ Kriegstein says. “In 
situations like this, where there’s considerable 
time pressure to try to unravel what’s going on 
and to protect the developing human brain, it’s 
especially important? = 
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Snow sensors seek best way 
to track the white stuff 


Airborne experiments aim to fillin the blanks of global water resources as the climate changes. 


BY ALEXANDRA WITZE 


hen it comes to monitoring the 

world’s frozen places, ice gets 

most of the love. Satellites such as 
CryoSat-2, run by the European Space Agency 
(ESA), measure minute changes in Earth’s 
melting ice sheets. Now another group of cryo- 
spheric scientists hopes to get in on the action 
— by monitoring not ice, but snow. 

Snow measurements are crucial for under- 
standing the world’s water resources. But 
observations lag behind those of ice, mainly 
because remote sensing doesn’t work consist- 
ently across all snowy environments. From 
mountains to prairies to tundra, the sheer 
variety of landscapes has frustrated efforts to 
produce high-resolution, worldwide maps. 

“The biggest hole in our knowledge of 
the global water budget is snow,’ says Jeffrey 
Deems, a research scientist at the US National 
Snow and Ice Data Center in Boulder, Colo- 
rado. “We really have no idea how much is out 
there?” 

Last week, at a workshop in Seattle, Wash- 
ington, Deems and his colleagues settled 
on a plan to change that, when they laid out 
details for a multiyear NASA field campaign 
scheduled to begin in September. The SnowEx 
experiment will fly aeroplanes carrying a 
combination of remote-sensing instruments 
— including radar, laser altimeters (lidar) and 
multispectral imagers — over snowy land- 
scapes. The goal is to see which techniques 
work best for studying snow, and to combine 
those in a design for a snow-sensing satellite. 

Snow information is becoming more crucial 
as the climate changes, says Matthew Sturm, a 
snow scientist at the University of Alaska Fair- 
banks. More than 1.2 billion people worldwide 
rely on mountain snowpacks for water — but 
in many areas, snowfall may decrease in the 
future (J. S. Mankin et al. Environ. Res. Lett. 
10, 114016; 2015). In California, for example, 
the ongoing drought means that water manag- 
ers are increasingly eager for any information 
about how much runoff to expect, and when, 
throughout the summer. 

In Alaska, changing snow cover affects how 
fast permafrost thaws, destabilizing the land- 
scape. And as Arctic sea-ice cover shrinks, so 
too does its protective snow cover, leading to 
feedback loops of increasing ice destruction. 


EYES ON THE SNOW 


Remote-sensing measurements could finally let scientists monitor Earth’s snow resources — which provide 
drinking water for billions of people. NASA is planning to test various combinations of sensors to see which 
do best at quantifying how much snow lies on a landscape and how quickly it is likely to melt away. 


Hyperspectral/ 

multispectral imager 
To measure snow cover, 
albedo and grain size 
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Current satellites have limited ability to 
track these changes. ESA’s now-concluded 
GlobSnow project used satellite microwave 
data to map global ‘snow water equivalent’ — 
the crucial estimate of how much water is con- 
tained in the world’s snowpacks, calculated 
by multiplying snow depth by density. But 
GlobSnow’s maps, with pixels 25 kilometres 
on each side, are too low-resolution for precise 
estimates in individual watersheds. 

In the past few years, NASA and ESA have 
each rejected proposals for more-detailed 
snow- observing satellites. Both missions 
would have used radar to measure snow depth 
and calculate snow water equivalent, and both 
were doomed by doubts that researchers could 
reliably extract that information from the type 
of radar proposed. “For a long time we were on 
a quest for a single sensor,” Sturm says. “The 
snow’s just not that simple.” 

Now the focus is shifting to testing multiple 
sensors simultaneously, to see which combina- 
tion works best. The first SnowEx flights will 
carry lidar instruments and several types of 
radar (see ‘Eyes on the snow’) over the Sierra 
Nevada or Rocky Mountains in western North 
America. Each will measure snow cover by 
monitoring how lidar or radar pulses bounce 
off the ground and reflect back to the plane. The 
instruments will include new technologies that, 
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Lidar (laser altimeter) 
To calculate snow depth 


Passive microwave detector 
To calculate snow depth 


Radar 
To calculate snow depth 


Passive visible light/ 
infrared detector 
To measure snow 
cover and albedo 


when used together, may avoid the problems of 
the rejected satellite radar, says Edward Kim, 
lead scientist for SnowEx at NASA’s Goddard 
Space Flight Center in Greenbelt, Maryland. 

One method is already becoming enor- 
mously popular in California and other parts of 
the western United States. For the past several 
years, NASA has flown a lidar-equipped plane 
called the Airborne Snow Observatory (ASO) 
over several western watersheds. The observa- 
tory measures the shape of the terrain in the 
summer, when there is no snow on the ground, 
and then returns throughout the winter to 
measure the changing depth of the snowpack. 

Project scientists can build up highly 
detailed maps — down to 1.5-metre resolution 
— of watersheds such as the Tuolumne River 
Basin, which supplies the city of San Francisco 
in California. Water managers use the resulting 
data to estimate how much runoff to expect in 
the spring. “We've never had that across these 
mountain basins before,” says Thomas Painter, 
who heads the ASO project at NASA's Jet Pro- 
pulsion Laboratory in Pasadena, California. 

SnowEx will build on the ASO by testing 
extra instruments. “This is the number one 
priority,” says Jessica Lundquist, a hydrolo- 
gist at the University of Washington in Seattle. 
“We need to figure out how to measure snow 
right.” a 
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| NEUROSCIENCE | 

Human Brain 
Project releases 
computing tools 


Move signals effort to benefit 
wider community. 


BY QUIRIN SCHIERMEIER & ALISON ABBOTT 


urope’s major brain-research project has 
| a set of prototype computing 

tools and called on the global neuro- 
science community to start using them. 

Release of the tools, which marks the start of 
the operational phase of the Human Brain Pro- 
ject (HBP), could help to allay concerns about 
the €1-billion (US$1.1-billion) venture’ benefits 
to the wider scientific community. But it is not 
yet clear how the platforms will resonate with 
researchers outside the project. 

“At this point, no one can say whether or not 
the research platforms will be a success,’ says 
Andreas Herz, chair of computational neuro- 
science at the Ludwig Maximilian University 
of Munich in Germany. 

The computing platforms include brain- 
simulation tools, visualization software and a 
pair of remotely accessible supercomputers that 
researchers can use to study brain processes in 
real time. Some will be freely accessible, others 
available only on the success of a peer-reviewed 
application (see go.nature.com/rlpjpz). 

The HBP’s general mission is to combine 
different types of neuroscience data to recon- 
struct the human brain computationally at dif- 
ferent scales, and to simulate it. But the project 
has attracted controversy. In 2014, around 
150 neuroscientists from outside the project 
signed a petition claiming that it was running 
offits scientific course, and pledged to boycott 
it unless their concerns were addressed. An 
independent review completed in March 2015 
stressed, among other things, that the comput- 
ing infrastructure created by the HBP must be 
useful to the wider scientific community. 

Release of the neuroscience tools is a sign 
that the HBP’s leadership accepts that the 
project needs to focus on providing concrete 
services for the neuroscience community 
at large, says Herz. But he cautions that the 
project still hinges on “logical flaws”, such 
as “the dream” that sparse recordings from 
neurons can generate dense data. 

The platforms are to be developed with 
a view to eventually becoming part of a 
permanent, pan-European research infrastruc- 
ture, say the HBP’s organizers — a step that 
would require the project to secure permanent 
funding commitments from governments. m 
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The high seas host a wealth of marine life but lack comprehensive rules to guide conservation. 


Talks aim to tame 
marine Wild West 


Nations debate how to protect biodiversity in the high seas. 


BY DANIEL CRESSEY 


Tes might host habitats of huge 
ecological importance, but two- 
thirds of the world’s oceans lie 
beyond the authority of national govern- 
ments. On 28 March, members of the 
United Nations began negotiating the first 
global treaty to impose conservation and 
sustainability on the high seas. As well as 
vastly increasing the areas that could be set 
aside to protect endangered species, a legally 
binding treaty could usher in laws on the use 
of marine organisms in the search for drugs 
and cosmetics. 

“This is something that could change the 
future of how we manage our oceans,’ says 
Elizabeth Wilson, director of international 
ocean policy at the Pew Charitable Trusts in 
Washington DC, which has advocated for 
such a treaty. 

Currently, nations can claim waters up 
to 200 nautical miles (370 kilometres) from 
their shorelines as ‘exclusive economic zones} 
everything beyond these is designated the 
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high seas (see ‘Sea change’). There are trea- 
ties that govern specific high-seas activities, 
such as mining the sea bed for minerals or 
laying cables, and agreements that regulate 
some kinds of fishing in various areas. But 
there is no comprehensive set of regulations 
on biodiversity and conservation. 

The ongoing treaty discussions, which 
are at the UN headquarters in New York, 
are the first talks of a preparatory commit- 
tee, known as Prep Com — and the result 
of years of wrangling between member 
nations. If they, and three more meetings 
before the end of 2017, are successful, a text 
could be voted on by UN members as soon 
as 2018, Wilson says. 

One goal of the UN negotiations is a 
mechanism for the creation of marine pro- 
tected areas (MPAs) on the high seas. As part 
of the UN Convention on Biological Diver- 
sity, most governments are now committed 
to designating at least 10% of the oceans as 
protected areas by 2020 — currently, around 
2% lies in MPAs. But legally, they can create 
MPAs only in their own waters, so some of 
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the most ecologically important 
areas remain out of reach. 

Moreover, in a study published 
last month, marine scientists Cal- 
lum Roberts, Bethan O’Leary and 
their colleagues at the University 
of York, UK, conclude that the 
10% target is much too low. On the 
basis of a review of 144 studies that 
assessed the adequacy of the UN 
target, they estimate that more than 
30% of the ocean must be protected 
to achieve goals such as protecting 
biodiversity and minimizing the 
risk that fish populations will col- 
lapse (B. C. O'Leary et al. Conserv. Lett. http:// 
doi.org/bdxw; 2016). But it would be near 
impossible to set aside this much-larger por- 
tion without being able to create MPAs in the 
high seas, notes Roberts. “These negotiations 
over the next two years are going to be vitally 
important,’ he says. 


BROAD APPROACH 

Others say that MPAs aren't necessarily the 
best way to protect the oceans, because they 
can lead to problems such as overfishing in 
non-protected areas. Rather than setting aside 
large areas for protection, a better approach 
would be to manage activities properly across 
the ocean, says oceanographer James Cowan 


SEA CHANGE 


Negotiations have begun to allow the creation of marine reserves not just in 
the exclusive economic zones that can be claimed by nations, but throughout 
the oceans. 


> —« 


at Louisiana State University in Baton Rouge. 

Another issue being discussed during the 
negotiations is how to govern the search for 
genetic resources — plants and animals that 
could yield products such as drugs or cosmetics. 


Exclusive economic zones | High seas 
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Bioprospecting is on the rise in the 
oceans, particularly in the high 
seas, says Glen Wright, a marine- 
policy researcher at the Institute for 
Sustainable Development and Inter- 
national Relations in Paris. 

The G77 group of develop- 
ing nations argues that genetic 
resources from the high seas are 
part of the ‘common heritage of 
mankind; so any profits from them 
should be shared among all nations. 
Other groups, including the Euro- 
pean Union would rather avoid 
such a formal status, says Wright, 
and focus instead on a practical mechanism 
for sharing the benefits. But at this stage, say 
Wright and Wilson, it is unclear what rules 
might be brought in, and what mechanisms 
might be used to enforce them. = 


CORRECTIONS 


warranted. 


In the News story ‘Clash over killer-whale captivity’ (Nature 531, 426-427; 2016), we 
erroneously referred to Jeffrey Ventre as a veterinary surgeon. He is, in fact, a physician. And in 
the story ‘Scientists say “no” to UK exit from Europe in Nature poll’ (Nature 531, 559; 2016), 
the labels numbered 2 and 4 in the graphic ‘What impact would a UK exit from the EU have 
on UK science?’ were the wrong way round. The opening paragraph also potentially implied 
that the poll was more representative of scientists in the UK and greater EU than perhaps was 
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POCKET 
PSYCHIATRY 


MOBILE MENTAL-HEALTH 
APPS HAVE EXPLODED 
ONTO THE MARKET, 

BUT FEW HAVE BEEN 
THOROUGHLY TESTED. 


BY EMILY ANTHES 


hundred programs will pop up on the screen. There are apps 

that diagnose depression (Depression Test), track moods 
(Optimism) and help people to “think more positive” (Affirmations). 
There’s Depression Cure Hypnosis (“The #1 Depression Cure Hypno- 
sis App in the App Store”), Gratitude Journal (“the easiest and most 
effective way to rewire your brain in just five minutes a day”), and 
dozens more. And that’s just for depression. There are apps pitched at 
people struggling with anxiety, schizophrenia, post-traumatic stress 


T ype depression into the Apple App Store and a list of at least a 
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disorder (PTSD), eating disorders and addiction. 

This burgeoning industry may meet an important need. 
Estimates suggest that about 29% of people will experience a 
mental disorder in their lifetime’. Data from the World Health 
Organization (WHO) show that many of those people — up to 
55% in developed countries and 85% in developing ones — are 
not getting the treatment they need. Mobile health apps could 
help to fill the gap (see “Mobilizing mental health’). Given the 
ubiquity of smartphones, apps might serve as a digital life- 
line — particularly in rural and low-income regions — put- 
ting a portable therapist in every pocket. “We can now reach 
people that up until recently were completely unreach- 
able to us,” says Dror Ben-Zeev, who directs the mHealth for 
Mental Health Program at the Dartmouth Psychiatric Research 
Center in Lebanon, New Hampshire. 


learn more about PTSD, track symptoms and set up a support 
network of friends and family members. The app also provides 
strategies for coping with overwhelming emotions; it might sug- 
gest that users distract themselves by finding a funny video on 
YouTube or lead users through visualization exercises. 

In its first three years in app stores, PTSD Coach was down- 
loaded more than 150,000 times in 86 different countries. It 
has shown promise in several small studies; in a 2014 study of 
45 veterans, more than 80% reported that the app helped them to 


“IF YOU TYPE IN ‘DEPRESSION’, 


ITS HARD TO KNOW IF THE 
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APPS THAT YOU GET BACK ARE 
HIGH QUALITY, IF THEY WORK, 
IF THEY’RE EVEN SAFE TO USE.” 


Public-health organizations have been buying into the concept. 
In its Mental Health Action Plan 2013-2020, the WHO recom- 
mended “the promotion of self-care, for instance, through the 


use of electronic and mobile health technologies.” And the UK 
National Health Service (NHS) website NHS Choices carries a 
short list of online mental-health resources, including a few apps, 
that it has formally endorsed. 

But the technology is moving a lot faster than the science. 
Although there is some evidence that empirically based, well- 
designed mental-health apps can improve outcomes for patients, 
the vast majority remain unstudied. They may or may not be 
effective, and some may even be harmful. Scientists and health 
officials are now beginning to investigate their potential benefits 
and pitfalls more thoroughly, but there is still a lot left to learn and 
little guidance for consumers. 

“If you type in ‘depression; its hard to know if the apps that 
you get back are high quality, if they work, if they’re even safe to 
use,” says John Torous, a psychiatrist at Harvard Medical School 
in Boston, Massachusetts, who chairs the American Psychiatric 
Association’s Smartphone App Evaluation Task Force. “Right now 
it almost feels like the Wild West of health care” 


APP HAPPY 

Electronic interventions are not new to psychology; there is 
robust literature showing that Internet-based cognitive behav- 
ioural therapy (CBT), a therapeutic approach that aims to change 
problematic thoughts and behaviours, can be effective for treat- 
ing conditions such as depression, anxiety and eating disorders. 
But many of these online therapeutic programmes are designed 
to be completed in lengthy sessions in front of a conventional 
computer screen. 

Smartphone apps, on the other hand, can be used on the go. 
“It's a way of people getting access to treatment that’s flexible and 
fits in with their lifestyle and also deals with the issues around 
stigma — if people are not quite ready to maybe go and see their 
doctor, then it might bea first step to seeking help,’ says Jen Martin, 
the programme manager at MindTech, a national centre funded by 
the United Kingdom's National Institute for Health Research and 
devoted to developing and testing new mental-health technologies. 

One of the best-known publicly available apps was devised to 
meet that desire for flexibility. In 2010, US government psycholo- 
gists conducting focus groups with military veterans who had 
PTSD learned that they wanted a tool they could use whenever 
their symptoms flared up. “They wanted something that they 
could use in the moment when the distress was rising — so when 
they were in line at the supermarket,” says Eric Kuhn, a clinical 
psychologist and the mobile apps lead at the US Department of 
Veterans Affairs’ National Center for PTSD. 

The department joined up with the US Department of Defense 
to create PTSD Coach, a free smartphone app released in early 
2011. Anyone who has experienced trauma can use the app to 


track and manage their symptoms and provided practical solu- 
tions to their problems”. More results are expected soon. Kuhn 
and his colleagues recently completed a 120-person randomized 
trial of the app, anda Dutch team is currently analysing data from 
a 1,300-patient trial on a similar app called SUPPORT Coach. 

Smartphone apps can also interact with users proactively, 
pinging them to ask about their moods, thoughts and overall 
well-being. Ben-Zeev created one called FOCUS, which is geared 
towards patients with schizophrenia. Several times a day, the app 
prompts users to answer questions such as “How well did you 
sleep last night?” or “How has your mood been today?” If users 
report that they slept poorly, or have been feeling anxious, the app 
will suggest strategies for tackling that problem, such as limiting 
caffeine intake or doing some deep-breathing exercises. 

Some apps help people to stay connected to health-care 
professionals, too. ClinTouch, a psychiatric-symptom-assessment 
app designed by researchers at the University of Manchester, UK, 
analyses users responses for signs that they may be experiencing 
a relapse; it can even notify a clinical-care team. 

Small feasibility studies — which are generally designed to 
determine whether an intervention is practical, but do not neces- 
sarily evaluate its efficacy — have shown that patients use and like 
both apps, and a 2014 study found that those who used FOCUS 
for a month experienced a reduction in psychotic symptoms and 
depression’. FOCUS and ClinTouch are both now being evaluated 
in randomized, controlled trials. 

Some researchers see opportunities in the data that smartphones 
collect about their users’ movement patterns or communication 
activity, which could provide a potential window into mental 
health. “Your smartphone is really this interesting diary of your 
life” says Anmol Madan, the co-founder and chief executive of 
Ginger.io, a digital mental-health company based in San Fran- 
cisco, California. Studies have now shown that certain patterns of 
smartphone use can predict changes in mental-health symptoms’; 
a drop in the frequency of outgoing text messages, for instance, 
may suggest that a user's depression is worsening. 

The Ginger.io app, which is still in beta, monitors these sorts 
of patterns and alerts each user’s assigned mental-health coach if 
it detects a worrying change. 


ABSENT EVIDENCE 

The evidence supporting the use of such apps is building*’. But this 
is a science in its infancy. Much of the research has been limited to 
pilot studies, and randomized trials tend to be small and unrepli- 
cated. Many studies have been conducted by the apps’ own devel- 
opers, rather than by independent researchers. Placebo-controlled 
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trials are rare, raising the possibility that a ‘digital placebo effect’ 
may explain some of the positive outcomes that researchers have 
documented, says Torous. “We know that people have very strong 
relationships with their smartphones,’ and receiving messages and 
advice through a familiar, personal device may be enough to make 
some people feel better, he explains. 

But the bare fact is that most apps haven't been tested at all. A 
2013 review* identified more than 1,500 depression-related apps 
in commercial app stores but just 32 published research papers 
on the subject. In another study published that year’, Australian 
researchers applied even more stringent criteria, searching the 
scientific literature for papers that assessed how commercially 
available apps affected mental-health symptoms or disorders. 
They found eight papers on five different apps. 

The same year, the NHS launched a library of “safe and trusted” 
health apps that included 14 devoted to treating depression or 
anxiety. But when two researchers tooka close look at these apps 
last year, they found that only 4 of the 14 provided any evidence 
to support their claims"’. Simon Leigh, a health economist at Life- 
code Solutions in Liverpool, UK, who conducted the analysis, 
says he wasn't shocked by the finding because efficacy research 
is costly and may mean that app developers have less to spend on 
marketing their products. 

A separate analysis” found that 35 of the mobile health apps 
originally listed by the NHS transmitted identifying informa- 
tion — such as e-mail addresses, names and birthdates — about 
users over the Internet, and two-thirds of these did not encrypt 
the data. 

Last year, the NHS took this apps library offline and posted a 
smaller collection of recommended online mental-health services. 
The NHS did not respond to e-mailed questions or make an official 
available for interview, but it did provide this statement: “We are 
working to upgrade the Health Apps Library, which was launched 
as a pilot site in 2013 to review and recommend apps against a 
defined set of criteria which included data protection” 

The regulation of mental-health apps is opaque. Some apps 
designed to be used in a medical context can be considered med- 
ical devices and therefore may be regulated by the UK Medicines 
and Healthcare Products Regulatory Agency, the US Food and 
Drug Administration (FDA) or equivalent bodies elsewhere. 
But the lines are fuzzy. In general, an app that claims to prevent, 
diagnose or treat a specific disease is likely to be considered a 
medical device and to attract regulatory scrutiny, whereas one 
that promises to ‘boost mood or provide ‘coaching’ might not. 
The FDA has said that it will regulate only those health apps 
that present the highest risks to patients if they work 
improperly; even mental-health apps that qualify as medical 
devices might not be regulated if the agency deems them to be 
relatively low risk. 

But the potential risks are not well understood. “At the low end, 
people might waste their money or waste their time,” says Martin, 
“and at the higher end, especially with mental health, they might 
be actively harmful or giving dangerous advice or preventing peo- 
ple from going and getting proper treatment.” 

When a team of Australian researchers reviewed 82 commer- 
cially available smartphone apps for people with bipolar dis- 
order”, they found that some presented information that was 
“critically wrong”. One, called iBipolar, advised people in the 
middle of a manic episode to drink hard liquor to help them to 
sleep, and another, called What is Biopolar Disorder, suggested 
that bipolar disorder could be contagious. Neither app seems to 
be available any more. 

Martin says that in Europe, at least, apps tend to come in two 
varieties, those that are commercially developed and come with 
little supporting evidence or plans for evaluation, and those 
with academic or government backing that take a more rigorous 


2016 Macmillan Publishers Limited. All rights reserved 


SOURCES: WHO; REF. 1; GSMA; IMS INST. 


approach. The problem is that the former are generally more 
engaging for users and the latter take so long to make it to the 
market — if they even do — that they look out of date. “This is a 
generalization,” Martin says, “but it’s broadly true.” 


UNINTENDED CONSEQUENCES 

Even well-intentioned apps can produce unpredictable outcomes. 
Take Promillekoll, a smartphone app created by Sweden's govern- 
ment-owned liquor retailer, designed to help curb risky drinking. 
While out at a pub or a party, users enter each drink they consume 
and the app spits out an approximate blood-alcohol concentration. 

When Swedish researchers tested the app on college students, 
they found that men who were randomly assigned to use the app 
ended up drinking more frequently than before, although their total 
alcohol consumption did not increase. “We can only speculate that 
app users may have felt more confident that they could rely on the 
app to reduce negative effects of drinking and therefore felt able 
to drink more often,” the researchers wrote in their 2014 paper”. 

It’s also possible, the scientists say, that the app spurred male 
students to turn drinking into a game. “I think that these apps are 
kind of playthings,” says Anne Berman, a clinical psychologist 
at the Karolinska Institute in Stockholm and one of the study’s 
authors. There are other risks too. In early trials of ClinTouch, 
researchers found that the symptom-monitoring app actu- 
ally exacerbated symptoms for a small number of patients with 
psychotic disorders, says John Ainsworth at the University of 
Manchester, who helped to develop the app. “We need to very 
carefully manage the initial phases of somebody using this kind 
of technology and make sure they’re well monitored,” he says. 

Ina pilot trial published earlier this year, ten US veterans with 
PTSD were randomly assigned to use PTSD Coach on their own 
for eight weeks, while another ten used the app with the support 
and guidance of primary-care providers. At the end of the trial, 
seven of the ten patients using the app with support showed a 
reduction in PTSD symptoms, compared with just three of the 
patients who used the app on their own”. 

But if apps require medical supervision, that undermines the 
idea that they will serve as an easy and low-cost way to provide 
care to the masses. “People think there’s an app for everything,” 
says Helen Christensen, the director of the Black Dog Institute 
at the University of New South Wales in Sydney, Australia, who 
has developed and studied mental-health apps. “It’s actually 
about how we build systems around apps, so that people have 
health care” 

Distributing mental-health apps in the developing world 
presents further challenges. Although mobile technology is 


135 to 300. And the number of trials specifically focused on 
mental and behavioural health increased by 32%, according to a 
report by the IMS Institute for Health Informatics in Parsippany, 
New Jersey. 

One digital health company that has earned praise from 
experts is Big Health, co-founded by Colin Espie, a sleep sci- 
entist at the University of Oxford, UK, and entrepreneur Peter 
Hames. The London-based company’s first product is Sleepio, 
a digital treatment for insomnia that can be accessed online 
or as a smartphone app. The app teaches users a variety of 
evidence-based strategies for tackling insomnia, including 


“WE NEED TO VERY CAREFULLY 
MANAGE THE INITIAL PHASES 


OF SOMEBODY USING THIS 
TECHNOLOGY AND MAKE SURE 
THEY’RE WELL MONITORED.” 


techniques for managing anxious and intrusive thoughts, 
boosting relaxation, and establishing a sleep-friendly environment 
and routine. 

Before putting Sleepio to the test, Espie insisted on creating 
a placebo version of the app, which had the same look and feel 
as the real app, but led users through a set of sham visualiza- 
tion exercises with no known clinical benefits. In a randomized 
trial, published in 2012, Espie and his colleagues found that 
insomniacs using Sleepio reported greater gains in sleep effi- 
ciency — the percentage of time someone is asleep, out of the 
total time he or she spends in bed — and slightly larger improve- 
ments in daytime functioning than those using the placebo app”. 
In a follow-up 2014 paper’, they reported that Sleepio also 
reduced the racing, intrusive thoughts that can often interfere 
with sleep. 

The Sleepio team is currently recruiting participants for a large, 
international trial and has provided vouchers for the app to sev- 
eral groups of independent researchers so that patients who enrol 
in their studies can access Sleepio for free. 

“We think this is the way forward for digital health,’ says Espie. 
Mobile-phone-based treatments, he says, “should be tested and 
judged like any other intervention. We shouldnt treat people's 
health with any less respect because the treatment is coming 
through an app.” m SEE COMMENT P. 25 


spreading rapidly, there are many people who do not have — or 
cannot afford — smartphones or mobile Internet access. And the 
content of apps needs to be delivered in local languages and reflect 
local cultures. “The notion that you can take an intervention and 
just plop it down in a region where people might not even use the 
same terms for mental health as you're using is a little unrealistic,’ 
says Ben-Zeev. “What we might call ‘hearing voices’ in the United 
States might be something like ‘communicating with your elders’ 
in a different region, depending on what label people attach to 
that experience” 

At this point, the notion that apps can deliver quality health 
care in low-income regions remains largely theoretical. “This is 
generally where the mHealth field is,” says Natalie Leon, a scientist 
at the South African Medical Research Council in Cape Town. “It’s 
a promise of potential effectiveness.” 


GOOD PRACTICE 

To make good on that promise, apps will have to be tested. 
Between 2013 and 2015, the number of mobile-health 
trials registered on ClinicalTrials.gov more than doubled, from 


Emily Anthes is a freelance journalist in New York City. 
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Soundtrack of the Universe 


Sheila Rowan ponders a chronicle of the long road to gravitational- wave detection. 


ravitational waves were first detected 
ce 14 September 2015 — a landmark 
event predicted a century before by 
Albert Einstein as part of his general theory 
of relativity. Janna Levin opens her chroni- 
cle of the long and twisting journey to the 
moment of detection, Black Hole Blues and 
Other Songs from Outer Space, with a quote 
from Niccolé Machiavelli’s The Prince: 
“There is nothing more difficult to take in 
hand, more perilous to conduct, or more 
uncertain in its success, than to take the lead 
in the introduction of a new order of things.” 
Thus, words from a political treatise written 
in 1513 encapsulate the difficult birth of a 
new field: gravitational-wave astronomy. 
The songs that Levin describes are the 
‘chirp’ of almost unfathomably faint grav- 
itational-wave signals: the soundtrack to 
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our view of the skies. 
A signal arising froma 
collision between two 
black holes 400 mil- 
lion parsecs away 
was picked up by the 
US-led Advanced 
Laser Interferometer 
Gravitational-Wave 
Observatory (LIGO). 


V 


Black Hole Blues 


and Other Songs ; : 
from OuterSpace Levin, a theoreti- 
JANNA LEVIN cal cosmologist, tells 


Alfred Knopf: 2016. the story of LIGO 
through interviews 
with key protagonists, including physicists 
Rainer Weiss, Ronald Drever, Kip Thorne 
and Rochus Vogt. She explains in clear 
terms the scientific heart of this achieve- 
ment and the deep and personal fascination 
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that pursuing it has held for several genera- 
tions of scientists. She also captures the cost 
of getting to this point, both financial — this 
is big science in its truest sense — and, in 
many cases, personal. In different ways, we 
are led to see how, from an idea, a calcu- 
lation, a basic experiment in a laboratory, 
the scientists involved drove the field for- 
ward from the 1960s, individually and then 
together. 

This is not the first book on LIGO’s 
history. Sociologist of science Harry Collins, 
for instance, has written several, including 
Gravity’s Ghost and Big Dog (University of 
Chicago Press, 2013; see M. de Laet Nature 
501, 164-165; 2013). Nor is it likely to be 
the last: the plot is too compelling. Verify- 
ing Einstein’s prediction demanded preci- 
sion technology and engineering beyond 


ILLUSTRATIONS BY REBEKKA DUNLAP. 


Ly 


anything achieved before. Optical instru- 
ments kilometres long were created, capable 
of sensing changes in the position of mir- 
rors (induced by gravitational waves) of less 
than one ten-thousandth the diameter of a 
proton. In tandem arose a deep understand- 
ing of the astrophysical sources far out in the 
Universe that might be — and we now know 
are — sending gravitational signals detect- 
able on Earth. 

The work was, moreover, a tug of war 
between funders and colleagues with faith 
in the endeavour, and other parties who, 
for various reasons, felt that it did not war- 
rant support. Levin describes this disso- 
nance well; it is genuinely painful to read 
in places. 

Levin is illuminating on how the very 
different Drever, Weiss and Thorne came to 
take the detection of gravitational waves as 
their life’s work. She shows how individual 
efforts at the Massachusetts Institute of 
Technology in Cambridge (led by Weiss) 
and the California Institute of Technology in 
Pasadena (led by Drever and Thorne) came 
together in the 1980s through the realiza- 
tion that the detection of gravitational waves 
would be possible only through a combined 


effort. In fact, they prefaced their 1989 fund- 
ing proposal to the US National Science 
Foundation with the quote from The Prince 
that Levin uses. They were aware even then 


SCIENTISTS BRING 
THEIR 


TO THEIR WORK: 


SHAPES 


of the difficult road ahead, although not the 
specific obstacles that would litter it. 

For instance, as Levin shows, within a 
few years the project's leadership changed 
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in unforeseen ways. Weiss and Drever, both 
strong leaders with individual ideas on pro- 
cess, and both passionate about the science, 
could not agree on a shared vision — even 
with Thorne’s mediation. Decision-making 
became impossible, threatening to derail 
the project. Vogt was appointed to restore 
some measure of stability. 

Scientists bring their whole history to 
their work: who they are shapes what they 
do, how they behave as leaders and how they 
make the research happen. So it is salutary 
to read about Drever’s background and its 
part in his ingenuity and thrift. He produced 
a stream of remarkable scientific ideas, 
sometimes starting with limited means 
and materials. Levin describes Drever in 
Glasgow, UK, carrying out research on 
Lorentz invariance — testing fundamen- 
tal aspects of the theory of relativity using 
basic materials in his back garden. But 
such inherent creativity does not necessar- 
ily sit easily with project management or 
the shared responsibilities that are part of 
leading a large scientific team. 

Similarly, Levin describes how Vogt’s 
management style was shaped by a dislike 
of authority that stemmed from experiences 
in his native Germany in the 1940s. This, 
too, became problematic in the steward- 
ship of LIGO, which, among other qualities, 
demanded the ability to take the strain of 
significant external oversight and formality. 
Further changes in directorship followed, 
bringing the LIGO project to the stable, suc- 
cessful state in which it remains today. Levin 
touches on those only lightly. 

Despite the upheavals, ‘unreasonableness’ 
— that singular drive on which “all progress 
depends’, according to playwright George 
Bernard Shaw — was clearly essential to the 
forging of this new scientific field. Weiss, 
Drever and Vogt all had that crucial, extra- 
ordinary ability to persevere against prevail- 
ing forces to enable progress. 

In many passages in Black Hole Blues, 
it is clear that we are reading the result of 
splitting the white light of truth through 
the prism of memory. In focusing mostly 
on the captivating details of a few indi- 
viduals, the full picture is lost, the whole 
remains largely unsynthesized, and gaps at 
the joins sometimes show. Most noticeably, 
the details in Levin’s chronicle stop around 
20 years ago. We skip from the end of Vogt’s 
tenure as LIGO director in the mid-1990s 
to the point at which the advanced LIGO 
instruments are about to take data in 
2015. As interesting as Levin’s narrative is, 
it leaves an entire exciting book still to be 
written. m 


Sheila Rowan is director of the Institute for 
Gravitational Research at the University of 
Glasgow, UK. 

e-mail: sheila.rowan@glasgow.ac.uk 
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METEOROLOGY 


The brainstormers 


Alan Thorpe enjoys a hymn to some of the founders of 
the science and institutions of weather forecasting. 


community of meteorologists and atmos- 
pheric scientists that accurate forecasts of 
major weather systems can be made reliably 
up to about a week ahead (see P. Bauer et al. 
Nature 525, 47-55; 2015). Many researchers 
contributed to the revolution in weather sci- 
ences in the first half of the twentieth century, 
so it is perhaps invidious to single out a few. 
Science historian James Fleming focuses on 
three: Norwegian Vilhelm Bjerknes, Swede 
Carl-Gustaf Rossby and American Harry 
Wexler. The first two I expected; the third I 
was intrigued to learn more about. Fleming 
devotes about 60 pages to each man’s life and 
work, and mentions many others and their 


NEWIN 
PAPERBACK 


I is thanks to the efforts of an international 


Highlights of this 
season’s releases 
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impacts, mostly on US weather forecasting. 
To the cognoscenti, the essentials of 
Bjerknes’s and Rossby’s science will be famil- 
iar. Fleming’ fascinating account clarifies why 
these two were giants of leadership. Bjerknes 
created the ‘Bergen school’ of meteorology, 
which used rigorous scientific principles 
to understand and predict the evolution of 
weather features such as fronts and cyclones. 
The school included his son Jacob among 
many talented, mainly Scandinavian, scien- 
tists. Rossby established university schools of 
meteorology in Stockholm and Chicago, and 
at the Massachusetts Institute of Technology 
in Cambridge. The breadth of Wexler’s role 
emerges through his contributions to the 
development of techniques and operational 
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weather forecasting, 
particularly in the 
United States. 
Between the birth 
of Bjerknes — the 
oldest — in 1862 and 
the death of Wexler, 
the youngest, in 1962, 
there passed a forma- 


tive and innovative [nventing — 
: Atmospheric 
century. As Fleming  ccience: 


reveals, their lives were 
linked, with Bjerknes 


Bjerknes, Rossby, 
Wexler, and the 


teaching Rossby and Foundations 
Rossby, Wexler. . cet 
a eteorology 
In 1904, in “Weather daiesenees 
forecasting asa prob- 5) -\jinic 


lem in mechanics and 
physics’ Bjerknes set 
the agenda for applying the laws of physics 
to the atmosphere to predict the weather 
(V. Bjerknes Meteorol. Z. 21, 1-7; 1904). 
His vision was to use a sufficiently accurate 
knowledge of the state of the atmosphere and 
the laws that govern its evolution to forewarn 
people about weather to come. His motiva- 
tion was to make his mark in what was for 
him a new field of science — he began his 
career working with his father, a physicist at 
the University of Oslo, on fluid analogies for 
the electromagnetic field. He was eager, too, 
to provide practical advice on hazards that 
affected mariners, farmers and the public. 

Fleming notes the absence of a book-length 
biography of Rossby, and I hope that this will 
be rectified soon. To me, he is a first among 
equals. As well as building institutions, he 
established important principles, such as the 
conservation of potential vorticity — used to 
understand the development of rotation in 
cyclones and other weather systems — and 
the large-scale atmospheric wave patterns 
named after him. He was a leader in devel- 
oping techniques such as experiments that 
simulate the atmosphere ina rotating tank of 
water, as well as aircraft soundings and the use 
of radiosondes, or radio-based measurements 
using weather balloons. A polymath with a 
high public profile, he was pictured on the 
cover of Time magazine in December 1956, 
with the title “Weatherman”. 

Wexler’s contributions include making 
the first research flight into a hurricane, 
using radar to track storm systems, working 


MIT Press: 2016. 


Climate Shock: The Economic Consequences of a Hotter Planet 

Gernot Wagner and Martin L. Weitzman (Princeton University Press, 2016) 

Global, long-term, irreversible, uncertain: four words used by economists Gernot 
Wagner and Martin Weitzman in their timely warning on the impacts of climate 
change. It is not enough to simply hope that we are wrong about worst-case scenarios; 
the authors’ most extreme predictions include a 20-metre sea-level rise and average 
global temperatures reaching 6°C above pre-industrial levels. Wagner and Weitzman 
urge us to act now to insure Earth against uncertainty. 


towards space observations and develop- 
ing the use of computers in meteorology. 

From the findings of these three men, 
Fleming expertly weaves a tapestry of 
broader developments, from early uses 
of computers and satellites to numeri- 
cal predictions with supercomputers. He 
explores intentional weather modification, 
radioactive fallout, rocketry, air pollution 
and electromagnetism. For example, the 
air movements made apparent when 
researchers tracked the fallout from 
nuclear-bomb tests in the 1950s provided 
insight into atmospheric circulation. 

The penultimate chapter covers what 
Fleming calls the birth of atmospheric 
science in the late 1950s, coinciding with 
planning for the International Geophysical 
Year in 1957-58. In 1956, Rossby proposed 
enlarging the definition of meteorology 
to include elements such as atmospheric 
chemistry and relevant biological pro- 
cesses. Much of what we regard as con- 
temporary developments, such as the 
understanding of atmospheric composi- 
tion or geoengineering, were in the minds 
of 1950s researchers, Fleming points out. 

Modifying the reflectivity of the planet 
to avoid harmful climate change was dis- 
cussed by Wexler and others as early as 
1958. In a 1962 speech, Wexler said: “We 
are in weather control now whether we 
know it or not.’ Fleming also focuses on 
the importance of committees in plan- 
ning developments such as the creation of 
the US National Center for Atmospheric 
Research in 1960, and the role of top 
researchers in leading these committees. 

A historical account has to have bound- 
aries, and Fleming barely hints at what 
came after 1960. I found the discussion of 
Edward Lorenz’s work on chaos that led to 
aspects of probabilistic forecasting dispro- 
portionately brief. 

What shines through Inventing Atmos- 
pheric Science is the commitment of three 
men to applications of research to society, 
and their desire to advance our under- 
standing of weather. This is an inspira- 
tional story, very well told. m 


Alan Thorpe is visiting professor of 
meteorology at the University of Reading, UK. 
e-mail: alan.thorpe@gmx.com 
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Leonard Mlodinow (Vintage, 2016) 
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Oilman at the peak 


Gregor Macdonald applauds a biography of prescient 
geologist and energy theorist Marion King Hubbert. 


scientist’s work does not always 
An neatly with the events of 

their time, but that was the good 
fortune of US geologist and oilman Marion 
King Hubbert (1903-89). After labouring 
for decades to perfect forecasting of the oil- 
production limit, he saw his efforts validated 
in the energy crises of the early 1970s. His 
approach would later be known as Hubbert 
peak theory. 

Journalist Mason Inman’s meticulous 
biography The Oracle of Oil follows Hub- 
bert from his youth on the plains of Texas 
through the Great Depression, the Second 
World War and the rise of US President 
Ronald Reagan in 1981. But its scope is 
much more expansive. In Hubbert’s story, 
Inman has found a meditation on the 
booms and busts that marked 
twentieth-century economic 
growth. Hubbert’s icono- 
clastic career forms a /¢ 
perfect arc, from oil's 
troubling oversupply 
in the 1920s to its rela- 
tive scarcity after the 
peak of US produc- 
tion in 1971, when the 
US economy suffered 
oil shocks. 

As Inman shows, Hub- 
bert’s impact extends beyond 
oil: it is an early manifestation of 
ecological economics. At the end of 
his career, Hubbert remained concerned 
about nuclear waste; was convinced that 
high rates of growth were environmentally 
destructive; and conjectured that solar 
power might be the most viable energy 
solution. With regard to growth or sustain- 
ability, Hubbert’s work is an overlooked 
contribution to US economic history. 

Arriving at the University of Chicago 
in Illinois in 1924 with little money but no 


GP \ . 


Carrier pigeons once toted stock prices. Today, 
instant messaging manages the job. Theoretical 
physicist Leonard Mlodinow explores how the 
most human of desires, a thirst for knowledge, 
grew from Neanderthal hunger pangs to 
measuring our planet’s orbit around the Sun. 
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The Oracle of Oil: A 
Maverick Geologist’s 
Quest for a Sustainable 
Future 

MASON INMAN 

W. W. Norton: 2016. 


shortage of ambition, Hubbert saw geology as 
a wide-open field, and gate-crashed it. With 
precocious brilliance, he began to identify 
gaps and disorganization in its practices, and 
published his first paper, on fault classifica- 
tions, as an undergraduate. In 1930, he was 
head-hunted by Columbia University in New 
York City to direct a new effort in geophysics. 
Here, Inman's biography reveals its 
thoughtful design, tracing the 
youthful roots of Hubbert’s 
formidable achievements. 
The undergrad ponder- 
ing Earth’s folds one 
summer later becomes a 
breakthrough geologist, 
solving complex scal- 
ing problems. The boy 
surrounded by oil rigs 
\f becomes a Shell execu- 
tive, aggressively pursuing 
the hydrological and struc- 
tural complexities of oilfield 
exploration. The young man who 
questioned religious faith becomes a 
nonconformist US Geological Survey (USGS) 
researcher, insisting that popular forecasts are 
built less on data than on optimism. 

Inman’s direct, explanatory style is well 
suited to describing the evolution of Hub- 
bert’s thinking. At Columbia, Hubbert began 
to ponder the S-curve of growth that has long 
fascinated observers of economies, biology 
and natural-resource extraction. He perfected 
his technique over decades, from rough > 


Why Information Grows 

César A. Hidalgo (Penguin, 2016) 

Economies are computers and information is at 
war with entropy, claims statistical physicist César 
Hidalgo. He shows how the scientific imagination 
needs knowledge and resources to grow, such as 
the Chilean copper that ‘feeds’ electronics (see 
Philip Ball’s review: Nature 521, 420-421; 2015). 
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> estimates of ultimately recoverable US oil 
reserves to his eventual winning model: an 
advanced calculation that incorporated past 
production, yield per foot of exploration and 
the tricky variable of reserve growth. 

Inman does not, however, cite the work of 
UK economist William Stanley Jevons, whose 
1865 warning about the economy’s over- 
reliance on coal prefigures the Hubbert story. 
Jevons died in 1882, so never saw his predic- 
tion come true: British coal output peaked in 
1913. Hubbert, by contrast, was feted with 
numerous awards, including the Rockefeller 
Service Award in 1977, and broad coverage 
in The New York Times when his previous 
reports for the government and the USGS 
were acknowledged for their accuracy. 

Hubbert’s forecast was not the end of the 
US oil story. After his death, production con- 
tinued to languish, in accordance with his 
forecast. But with fracking, the United States 
lifted oil production as recently as last year 
to levels close to the 1970 peak. Oil produc- 
tion is now falling again owing to a price bust 
— global supply capabilities were created for 
demand that failed to materialize. Inman does 
a fine job of handling this recent history. 

The Oracle of Oil offers valuable insights 
beyond energy. In the demand-side bust of 
the 1930s, it shows Hubbert thinking deeply 
about the surplus of labour created partly, in 
his view, by the effects of powerful oil mar- 
ried to the newest machines: cars, construc- 
tion equipment and aircraft. Hubbert was 
co-founder of Technocracy, a group of New 
York intellectuals aiming to prevent future 
economic dislocations. Two publications by 
keen observers of the low-growth problem — 
Thomas Piketty’s Capital in the Twenty-First 
Century (Seuil, 2013) and Robert Gordon's 
2012 paper ‘Is US Economic Growth Over?’ 
(see go.nature.com/wblxig) — also explore 
this territory of limits and sustainability. 

In Inman’s work, the oilman emerges as a 
restless and prescient figure concerned with 
the environment. In writing the first biogra- 
phy of Hubbert, Inman has retrieved, if not 
rescued, the story of a scientist who has much 
to offer to today’s energy conundrum. = 


Gregor Macdonald is a journalist based 
in Portland, Oregon, who covers the energy 
sector. 

e-mail: gregor@gregor.us 
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Listening in 
on yourself 


Douwe Draaisma is intrigued by a study examining both 
‘the voice within’ and verbal auditory hallucinations. 


hinking about thinking is a curious 
[Tees Most of us probably agree 

that much of our own thought pro- 
cess takes the form of inner speech. But 
would we also agree that we hear an inner 
voice? If not, why would we call it talk- 
ing at all? Can we experience and observe 
inner speech simultaneously, or would this 
be like “trying to turn up the gas quickly 
enough to see how the darkness looks’, 
as US psychologist William James asked 
himself in 1890? Each of these questions 
may lead one into a philosophical forest, 
dense and dark. 

Side-stepping such conceptual intrica- 
cies, psychologist Charles Fernyhough 
convincingly explores 
inner speech from a 
practical perspective. 
In The Voices Within, 
he discusses how people 
with aphasia (a speech 
and language disorder 
that stems from brain 
damage) may lose their 
sense of inner speech; 
how deaf people ‘talk 
to themselves’ (mostly 
in sign language, some 
by lipreading); how more than 60% of 
children have had silent conversations with 
imaginary friends; and whether people 
who stutter experience their inner speech 
as fluent (they do). Silent self-talk, evasive 
as it is to introspection, turns out to bea 
robust and quintessential part of memory, 
thought and imagination. 

Fernyhough’s sources are equally wide- 
ranging. He draws on internal monologues 


Spirals in Time: The Secret Life and Curious e e : 
Afterlife of Seashells ie € 06 
Helen Scales (Bloomsbury, 2016) | 4 ey 
From beachcombing to shipwreck diving, marine | S E E DS 
biologist Helen Scales shares her love of molluscs, | ®@ @ 
many of which convert seawater into protective [@ey 
homes. Shells, she reveals, have served as e & ry rv } 
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SILENT SELF-TALK 
TURNS OUT TO BEA 


ROBUST 


PART OF 
MEMORY. 


The Voices Within: 
The History and 
Science of How We 
Talk to Ourselves 
CHARLES FERNYHOUGH 
Profile: 2016. 


/ 


Bovary and reports of self-talk by profes- 
sional cricketers. He mentions physicist 
Richard Feynman having an argument 
with himself, and Joan of Arc insisting 
that God talked to her in French, not Latin. 
Today, Fernyhough directs Hearing the 
Voice, a research project 
at Durham University, 
UK, funded by biomedi- 
cal charity the Wellcome 
Trust. 


from experience that it is 
wise to switch my inner 
speech from Dutch, my 
native language, to Eng- 
lish, which I routinely use 
for scientific communi- 
cation. Most bilingual people have no trou- 
ble identifying the language that they are 
thinking in. But I would be hard-pressed 
to say whether I talk to myself at a natural 
speed or in an abbreviated way, much less 
whether the stream of my thoughts flows 
equally fast (or slow) in both languages. 
Most people say that they have the definite 
impression that their inner speech unfolds 
faster than actual speech. 


The Triumph of Seeds 

Thor Hanson (Basic, 2016) 

Biologist Thor Hanson sows the ultimate 
celebration of seeds and how they conquered 
Earth. Kernels can be crafty: unripe fruit, for 
instance, tastes bitter to deter predators from 
dispersing the seeds too soon (see Sandra 
Knapp’s review: Nature 519, 288-289; 2015). 


in Gustave Flaubert’s 1856 novel Madame 


When I’m invited to 
write a review, I know 


Referring to the ideas of philosopher Eric 
Schwitzgebel, Fernyhough points out that 
this leaves several questions open. Is inter- 
nal speech faster because it is unhampered 
by slower motor processes, or because we 
don’t think in full sentences? According to 
twentieth-century Russian psychologist Lev 
Vygotsky, inner speech develops as children’s 
conversations ‘go underground, often as a 
result of social encouragement. In the process, 
they begin to thinkin ‘pure meanings, causing 
telegraphic inner speech. This condensation, 
in turn, could help to explain the paradoxi- 
cal sensation that inner speech feels faster, 
but never rushed. Sped up or condensed? To 
this day, there are no consensual methods or 
techniques to decide between the two. 

Fernyhough also presents several inter- 
views with participants in his Durham 
research project, itself an offshoot of the 
Hearing Voices Movement founded in 1987 
by Dutch psychiatrist Marius Romme and 
now expanded to 23 countries. At present, 
the United Kingdom has 180 support groups 
for voice-hearers. Some people report hear- 
ing intrusive voices, often whispering or 


Infested: How the Bed Bug Infiltrated Our 
Bedrooms and Took Over the World 

Brooke Borel (University of Chicago Press, 2016) 

Bed bugs (Cimex /ectularius) are perfectly adapted 
for bloodsucking. Toothed mandibles pierce their 
victims’ skin and inject saliva proteins that widen the 
blood vessels and prevent clotting, reveals Brooke 
Borel in her creepy exposé of the household pest. 


shouting abuse, and seek relief by joining 
a local group. Most people who experience 
verbal auditory hallucinations have had a 
diagnosis of schizophrenia or have expe- 
rienced childhood abuse — but, as Ferny- 
hough points out, not all. Clinically relevant 
as they are, these chapters expose a tension 
in The Voices Within: it is still a matter of 
controversy whether hearing voices has 
much to do with the quiet self-talk of ordi- 
nary thinking. 

Fernyhough tries to bridge this gap with a 
model that he dubs dialogic thinking, which 
conceptualizes inner speech as an internal- 
ized conversation between different voices. 
He hypothesizes that if a patient fails to 
identify a particular utterance as a fragment 
of some inner dispute, he might experience 
this fragment as coming from an external 
source — a hallucination. But many voice- 
hearers also experience inner speech, and can 
distinguish between the two. It remains to be 
seen whether the experience of hearing voices 
will really offer a window on inner speech. 

We have come a long way from US psychol- 
ogist John Watson's behaviourist speculation 
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that inner speech is simply covert motor 
action in the speech apparatus. New methods 
have invited new distinctions. Neuroimaging 
studies by Fernyhough and his colleagues 
suggest that speaking internally when 
instructed activates Broca’s area in the brain 
(associated with producing speech), and de- 
activates Heschl's gyrus (associated with audi- 
tory perception). Spontaneous, free-flowing 
inner speech, on the other hand, involved 
the opposite pattern of neural activation. 
This is but one of many promising avenues 
of research, and The Voices Within is full of 
them. Profound and eloquent (he is a novelist 
too), Fernyhough presents an intriguing array 
of fresh findings and perspectives. He makes 
a persuasive case that one of the most inti- 
mate and private of our mental activities has 
a social origin. We talk to ourselves because 
we talked to others first. = 


Douwe Draaisma is professor of the history 
of psychology at the University of Groningen 
in the Netherlands. His latest book is 
Forgetting. 


e-mail: d.draaisma@rug.nl 


Life’s Greatest Secret: The Race to Crack the 
Genetic Code 

Matthew Cobb (Profile, 2016) 

Anecdotes abound in zoologist Matthew Cobb’s 
history of the quest to unravel the genetic code. 
Cobb updates the story with a look at gene- 
editing tool CRISPR and its role in gene therapy, 
agriculture and the control of invasive species. 
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The women who launched NASA 


Jennifer Light savours the history of a doughty band of ‘human computers’. 


ata is having its moment in the 
D sun. It used to be an incidental 

detail in stories about person- 
alities and institutions in the history 
of science and engineering, politics 
and culture. Now, the construction of 
databases and details of data analysis 
are featured events. Even the 2015 film 
Spotlight, which dramatizes a journal- 
istic investigation into sexual abuse in 
the Catholic Church, has an extended 
scene in which the protagonists com- 
pile a spreadsheet. Number crunchers, 
once stereotyped as missing the bigger 
picture, now are the big picture. 

Natalia Holt’s Rise of the Rocket Girls 
reveals how, from the 1940s, a group of 
‘crunchers’ operated in near secrecy at 
NASA — their anonymity as much to 
do with their gender as with the status 
of data. These women staffed NASA's 
Jet Propulsion Laboratory (JPL) in 
Pasadena, California, as human com- 
puters and, later, programmers. Most 
held degrees in mathematics or science. 
Their work, Holt reveals, was innovative, 
yet was considered less important than 
other JPL outputs. Holt documents the 
work of previously unknown figures 
such as Barbara Canright, Susan Finley 
and Helen Ling, who supplied crucial 
calculations that helped to launch mis- 
siles, bombers and the first US satellite; 
to control lunar missions; and even to 
navigate today’s Mars rovers. Intercut 
with the human stories, Holt carefully lays 
out practical problems — such as the need to 
minimize fuel weight while ensuring that a 
rocket could reach escape velocity — and how 
mathematics helped to solved them. 

Here, maths is dramatic, not mundane. 
Calculating is a physical, even athletic, act. 
Holt describes Canright’s experience: “Her 
right index finger was lined with thick red 
and white calluses, the result of clutching a 
pencil for hours a day. Her grip on the pencil 
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Why Did the Chicken Cross the World? 

Andrew Lawler (Atria, 2016) 

With 20 billion chickens roaming Earth at any one 
time, this beleaguered bird truly merits Andrew 
Lawler’s illuminating eulogy. It shows how the red 
junglefowl (Gallus gallus) strutting in a Borneo 
jungle became an economic mainstay (see Ewen 
Callaway’s review: Nature 515, 490-491; 2014). 


often made her hand perspire, leaving pucker 
marks across the graph paper.” The women 
held “computing races” to solve complex 
mathematical problems in their down time. 
But as Holt shows, ‘doing the maths’ was often 
a matter of life and death. She describes Marie 
Crowley's work on the ideal size and shape of 
a rocket’s nozzle opening: the faster a nozzle 
can clear exhaust, the more thrust the rocket 
generates. On one occasion, as engineers were 
about to run a test based on her calculations, 
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Rise of the Rocket 
Girls: The Women 
Who Propelled Us, 
from Missiles to 
the Moon to Mars 
NATHALIA HOLT 

Little, Brown: 2016. 
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Crowley realized that she had forgotten 
to take a square root. Her phone call 
came too late and the rocket failed — 
luckily, no one was hurt. Often rushed 
and never double-checked in those seat- 
of-the-pants days, the calculations were 
a source of anxiety for the women. 

The drama of data is not the book’s 
main rationale. Holt’s larger interest is 
that it was women who did the calcu- 
lating. The book is organized chrono- 
logically, with photographs of starring 
figures at the start of each section. In 
keeping with larger currents in historical 
studies of databases and data analysis, 
Holt argues that these women's calcula- 
tions played an under-appreciated part 
in NASAs towering achievements. She 
bases her chronicle on interviews with 
several JPL data crunchers, members of 
their families and lab engineers, as well 
as a reading of scholarly and archival lit- 
erature on the space programme. 

The book does not show equal mas- 
tery of research on gender and labour 
in technical fields: Holt provides more 
data than analysis. She depicts the human 
computers life stories vividly, but her reflec- 
tions on larger themes are limited. One is the 
relationship between people and machines. 
For example, Holt mentions the women’s 
limited use of calculation aids. They often 
found it faster to do hand calculations than 
to work with a Friden calculating machine. 
Even in the late 1950s, when JPL acquired an 
IBM 704 — an early mass-produced com- 
puter — the crunchers (and engineers) were 


Most Wanted Particle 

Jon Butterworth (The Experiment, 2016) 
Experimental physics becomes accessible as 
well as astounding in this insider’s account 

of the hunt for the Higgs boson, detected in 
2012. Explaining fermions, jet algorithms and 
particle-accelerator malfunctions with ease, Jon 
Butterworth puts a lively spin on atomic science. 


wary of machines “that had too many glitches 
to be trustworthy”. The speed and accuracy 
of later models threatened to make the math- 
ematicians roles redundant, but the women 
staved off obsolescence for some time by 
redefining their official jobs as programmers. 
Holt misses an opportunity here to make the 
connection between the experiences of JPL’s 
human computers in relation to the rise of 
automation, and today’s widespread anxieties 
over the ‘age of artificial intelligence. 

Personal anecdotes include details of the 
family arrangements that the women had to 
make in an era when marriage and children 
meant leaving the workforce. These will fas- 
cinate general readers and provide valuable 
primary source materials for future academ- 
ics. Yet Holt does not contextualize the JPL 
crunchers’ experience in the broader history 
of women in science and technology. She does 
observe that the proportion of JPL technical 
jobs held by women — 15% in the 1990s 
— was partly due to women organizing the 
hiring process. Yet although the figure was 
higher than elsewhere in NASA, it was still 
remarkably low. And in the same way that 
NASA failed to celebrate these women’s con- 
tributions, it also overlooked the evidence in 
favour of including women in the astronaut 
corps. As space curator Margaret Weitekamp 
has detailed in Right Stuff, Wrong Sex (Johns 
Hopkins University Press, 2004), the benefits 
of sending women to space included their 
small size compared to men, which meant 
that they would need less food and water. 

Engagement with such studies would have 
helped Holt to analyse how typical the JPL 
programmers experience was for women in 
technical fields, and to explicate her original 
findings more clearly. Many scholarly studies 
describe the exodus of women at other insti- 
tutions from programming by the 1950s. By 
contrast, Holt notes, some of the early female 
crunchers stayed at JPL well into the twenty- 
first century, some in managerial roles. That 
raises questions about what made JPL differ- 
ent and, crucially, about retaining women in 
technical fields today. That story remains to 
be written. m 


Jennifer Light is professor of science, 
technology and society at the Massachusetts 
Institute of Technology in Cambridge. 
e-mail: jslight@mit.edu 
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Coitus defunctus 


Lori Andrews assesses Henry Greely’s treatise on how 
technology will oust reproductive intimacy. 


everything from magic to medicine 

to influence the traits of their chil- 
dren. Recommendations from relatives 
and advice columns have, across time, 
included: put a knife under your bed if 
you want a boy; eat sweets while pregnant 
if you want a girl; place headphones on 
your pregnant belly and play Jacopo Peri’s 
Euridice to ensure that your future child 
will appreciate opera. 

Current genetic technologies offer 
parents more-precise means of predict- 
ing and perhaps shaping the traits of 
their children. Embryos created through 
in vitro fertilization (IVF) can be analysed 
using pre-implantation genetic diagnosis 
(PGD); their entire genomes can even be 
sequenced. And the gene-editing tech- 
nique CRISPR-Cas9 
is expected to one day 
offer couples a chance to 
repair and enhance the 
genes of their embryos. 

In The End of Sex and 
the Future of Human 
Reproduction, lawyer 
and bioethicist Henry 
Greely does an enviable 
job of explaining the sci- 
entific underpinnings 
and legal regulation 
of current reproduc- 
tive and genetic technologies. The central 
focus of his book is his prediction that a new 
technology will develop — one that he dubs 
“Easy PGD” Greely envisions a situation in 
which a woman will not have to undergo 
treatment with hormones and have her eggs 
removed to produce an embryo for testing 
— as is done in the course of IVE Instead, 
some of her skin cells will be removed and 
coaxed by stem-cell technologies to turn 


iz or millennia, people have been trying 


~<— Birth of a Theorem: A Mathematical Adventure ie as 


Cédric Villani (Vintage, 2016) 
Tackling the 140-year-old Boltzmann equation | 


GFforic (with Clement Mouhot) led mathematician Cédric HUMaNy IND 

4 LAN} | Villani to win a share in the 2010 Fields Medal. | ; 
a Documenting this quest, Villani encapsulates the |Ataror —— 
ns * | despair and elation that maths can incite (see Amir , * WaRcoun, | 
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THE PROCESS OF 
DECISION- 
MAKING 


WOULD BE 


DAUNTING. 


The End of Sex and 
the Future of Human 
Reproduction 

HENRY T. GREELY 
Harvard University Press: 
2016. 


V 


into eggs. The eggs will then be fertilized by 
sperm from her partner to create as many 
as 100 embryos. In this scenario, parents 
will be able to choose which embryos to 
have implanted in the woman's uterus on 
the basis of hundreds of traits (or more) 
revealed by whole-genome sequencing. 

Greely makes three claims about this 
putative Easy PGD. First, it will replace sex 
as a way to create babies. Second, it will be 
more socially acceptable 
than current PGD and 
prenatal genetic-testing 
technologies. And third, 
it will be free to the user. 

All three claims are 
problematic. 

Greely assumes that 
people will elect to 
create children with 
Easy PGD rather than 
through sex because 
of the desire to control 
the traits of their chil- 
dren. But if people were so keen to choose 
embryos on the basis of their genotypes, 
all couples who use IVF would submit 
their embryos to genetic testing (given 
that this would involve no extra risk to the 
woman). In the United States, only 5% of 
such couples do so. 

Greely also speculates that couples will 
rush to use stem cells to create embryos 
because such an approach would beless > 


Humankind: How Biology and Geography 
Shape Human Diversity 

Alexander Harcourt (Pegasus, 2016) 
Biogeographer Alexander Harcourt ponders the 
myriad forces that led to the amazing diversity 
of Homo sapiens as we spread across the globe. 
He thinks that coastal migration once prevailed: 
harvesting seafood was easier than hunting. 
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> invasive than IVE But there is no proof 
that the risks to women presented by IVF 
(such as ovarian hyperstimulation syndrome 
and pelvic infection) are a substantial deter- 
rent when it comes to couples who use the 
procedure: globally, more than 400,000 chil- 
dren are born through IVF each year. Nor 
is there any evidence that couples would 
embrace an untested stem-cell procedure, 
with unknown risks to the resulting child. 
Even if Easy PGD were risk-free, the 
process of decision-making would be daunt- 
ing. Whole-genome sequencing would 


Digital Capitalism 
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Pressed for Time: The Acceleration of Life in 


Judy Wajcman (University of Chicago Press, 2016) 
Sociologist Judy Wajcman sagely analyses the 
disparate experience of time as technology has 
evolved. Despite the common belief that smartphones 
heighten stress, she argues that we are not victims of 
machines, but masters of their role in our lives. 


provide prospective parents with hundreds 
or thousands of bits of information about 
each embryo. The BRCA1 tumour-suppressor 
gene alone consists of more than 81,000 base 
pairs. Of the hundreds of BRCA1 mutations 
that have been reported, some are associated 
with an increased risk of developing breast 
cancer or ovarian cancer; others are not, and 
the significance of many is not yet known. 

If the test reports a mutation of unknown 
significance in any of an embryo’s 
20,000 genes, parents might end up dis- 
carding a perfectly healthy embryo, or — if 
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they have the embryo implanted anyway — 
worrying throughout the child’s life about the 
potential manifestation of a horrible condi- 
tion. Even the mutations that are associated 
with disease will be hard to make sense of. 
What does a person do with the knowledge 
that a certain embryo will develop into some- 
one who has a breast-cancer risk 30% higher 
than that of the general population, double 
the baseline risk of developing Alzheimer’s 
disease, and a 45% possibility of achieving a 
higher college admission score than another 
embryo with an equally complex data set? 

Greely claims that Easy PGD will be more 
socially acceptable than current genetic test- 
ing of embryos and fetuses because, unlike 
existing approaches, it does not involve abor- 
tion. Yet whether the embryos are derived 
from skin cells or seeded with gametes, 
right-to-life advocates will oppose the termi- 
nation of embryos that could be implanted 
to create children. Furthermore, many more 
embryos would be discarded using Greely’s 
proposed Easy PGD than are now destroyed 
in the course of IVF and PGD. 

Greely’s final claim is that Easy PGD will 
be free. Using assumptions, including that 
it will cost US$1,000 to create 100 embryos 
from skin cells, $60 per embryo for genetic 
diagnosis, and $500 for genetic counselling, 
he estimates that it will cost about $11,000 
for the entire procedure. According to 
Greely, at that cost, “the procedure should 
pay for itself for health reasons” But current 
genetic technologies also save future health 
costs, and these techniques are not always 
covered by health-care systems. In fact, it is 
extremely difficult for women below a cer- 
tain income bracket to access them. 

Greely imagines the slogans that clinics 
will use to tout Easy PGD: “You want the 
best for your child; why not have the best 
child you can?” But after exploring Greely’s 
claims in detail, Easy PGD seems more likely 
to be an expensive technology of unknown 
risks that would present parents with hard- 
to-interpret information about as many as 
100 embryos. The idea of Easy PGD should 
make us uneasy indeed. m 


Lori Andrews is distinguished professor of 
law at Chicago-Kent College of Law, Illinois 
Institute of Technology. 

e-mail: landrews@kentlaw.iit.edu 


The Cosmic Cocktail: Three Parts Dark Matter 
Katherine Freese (Princeton University Press, 2016) 
What is the Universe made of? Physicist Katherine 
Freese chronicles the cracking of this beguiling 
enigma, from the eccentric, ski-jumping Fritz 
Zwicky (who coined the term dark matter) to 
particle-smashing physics (see Francis Halzen’s 
review: Nature 509, 560-561; 2014). Emily Banham 
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Embed stormwater 
use in city planning 


Potable water resources are 
being depleted at an alarming 
rate worldwide. Storm water is a 
hugely under-utilized resource 
that could help as extreme 
weather events become more 
frequent. 

The challenges of collecting 
and using storm water mean that 
the practice is not widespread. 
Rainfall tends to be seasonal, 
so storm water must be stored 
for use in dry periods in natural 
underground aquifers (see 
A. Mankad et al. J. Clean. Prod. 
89, 214-223; 2015) or in specially 
built reservoirs. The reliance of 
such projects on the weather can 
make the costs hard to justify. 

Storm water may also be heavily 
polluted and is expensive to treat. 
This can make alternatives such 
as imposed water rationing or 
water transfers from other areas 
more attractive — despite their 
human and environmental costs. 
Stormwater treatment would be 
more economically viable if less- 
purified water were used for non- 
drinking purposes. This would 
require wider public education, 
because the idea of recycled water 
is anathema to many. 

For expanding cities, ways to 
deploy storm water will need 
to be embedded into urban 
planning. The technology is 
already available and could be 
tailored to different situations 
(see, for example, T. D. Fletcher 
et al. J. Environ. Qual. 37, S-116- 
S-127; 2008; A. E. Barbosa et al. 
Water Res. 46, 6787-6798; 2012). 
Shunsuke Managi Kyushu 
University, Fukuoka, Japan. 
Ashantha Goonetilleke, Clevo 
Wilson Queensland University of 
Technology, Brisbane, Australia. 
managi@doc.kyushu-u.ac.jp 


Engage engineers in 
soil management 


The UK government's 
Environmental Audit Committee 
is due to release a report on its 
enquiry into soil health by this 


summer. With most soil guidance 
for the United Kingdom coming 
from the European Union 
and the United Nations, the 
possibility of Britain leaving 
the EU (‘Brexit’) makes UK soil 
protection even more pressing. 
US President Franklin D. 
Roosevelt declared in 1937 that 
“the nation that destroys its 
soils destroys itself” (see also 
L. Montanarella Nature 528, 
32-33; 2015). Since agriculture 
became industrialized, we 
have been taking more organic 
matter out of soils than we 
put in. Together with urban 
development, flooding and only 
partial return of organic wastes 
to soil, this means that soil 
carbon is being depleted globally, 
not just in the United Kingdom. 
Engineers could lead the way 
in redressing this balance, as 
they have for recycling precious 
metals such as catalytic platinum 
(see, for example, C. Hageliiken 
Platin. Met. Rev. 56, 29-35; 
2012). For instance, they can 
stabilize carbon in organic 
wastes using minerals such as 
manganese oxides (K. Johnson 
et al. Nature Commun. 6, 
7628; 2015) and optimize soil 
amendments to maintain and 
enhance soil ecosystem services. 
Engineers, who are currently 
largely excluded from sustainable 
soil management, will then be 
working with the environment 
— instead of just on top of it. 
Karen L. Johnson Durham 
University, UK. 
karen.johnson@durham.ac.uk 


A global baseline for 
ecosystem recovery 


Indicators of success for 
ecosystem restoration depend 
on a proper assessment of the 
extent to which human activity 
has degraded biodiversity 

and ecosystem functions and 
services. In the run-up to the 
13th Conference of the Parties 
on the 2020 Aichi biodiversity 
targets this year in Cancun, 
Mexico, we suggest how such 
assessments could be improved 


and standardized globally. 

The current state of 
ecosystems is measured by 
using direct observations. To 
detect trends in degradation 
or recovery over time, we must 
also define a baseline reference 
state. However, an arbitrarily 
chosen baseline, such as 50 years 
ago, might not reflect the true 
magnitude or direction of land 
degradation or recovery. 

Instead, we suggest using an 
ecosystem’s pre-degradation 
state, also known as its natural 
state. This state has no human- 
caused loss of biodiversity or of 
ecosystem functions. Contrasted 
with the ecosystem’s current 
state, it provides a robust 
indicator of damage or recovery. 
It is determined by scientific 
research, but should not be 
viewed as a restoration target — 
those are merely political goals 
that balance social, economic 
and ecological interests. 

A pre-degradation baseline 
could provide a universally 
applicable reference state. It 
reflects the deviation from 
genuine sustainability and, as a 
global standard, would ensure 
the fairness and comparability 
of ecosystem assessments 
across countries that are in 
different phases of economic 
development. 

Janne S. Kotiaho University of 
Jyvaskyla, Finland. 

Ben ten Brink PBL-Netherlands 
Environmental Assessment 
Agency, The Netherlands. 

Jim Harris Cranfield University, 
Bedfordshire, UK. 
janne.kotiaho@jyu.fi 


Holistic hydropower 
scheme for China 


We propose three ways in which 

China could cut the human 

and environmental cost of its 

hydropower developments. 
First, reform the laws that 

govern new dam developments 

in China. These should include 

provisions to address the 

impact of individual projects 

on river basins. Measuring and 


understanding the cascade 
effects of dams and predicting 
ecosystem trends can guide 
decisions on fish propagation 
and optimal reservoir operation, 
for instance. Charges for 
hydroelectricity need to be 
rationalized and migrants’ 
compensation standards 
updated. And inter-regional 
transfer payments to the 
authorities, intended to balance 
the social and environmental 
losses of affected areas with 

the gains in other areas, should 
correlate with the economic 
benefits of hydropower 
development. 

Second, pay more attention 
to resettlement. Migrant 
populations could be relocated 
to mountainous areas along 
cascade reservoirs, which would 
boost infrastructure and preserve 
natural resources. People would 
need government help with land 
planning, education, employment 
and forming new social networks, 
without any devaluation of their 
indigenous knowledge, social 
capital or traditions. 

Third, improve the 
governance of international 
rivers. China and its neighbours 
should share river management 
and water resources, 
cooperatively seeking solutions 
to the problems of hydropower 
production, environmental 
sustainability and climate 
change. China has a wealth of 
experience and technologies 
to contribute from its many 
previous hydropower projects. 
Wenzhe Tang* Tsinghua 
University, Beijing, China. 
twz@mail.tsinghua.edu.cn 
*On behalf of 7 correspondents (see 
go.nature.com/h6xza4 for full list). 


CORRECTION 

An article in the Nature Index 
2016 Japan (Nature 531, 
$127-S130; 2016) wrongly 
described an alumina film 

as having high thermal 
conductivity; in fact, it has 
high thermal insulation. It also 
misnamed JEOL Ltd as JOEL. 
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STELLAR ASTROPHYSICS 


Supernovae in the neighbourhood 


Detailed measurements of radioisotopes in deep-sea deposits, plus modelling of how they reached Earth, indicate that 
many supernovae have occurred near enough to have potentially influenced evolution. SEE LETTERS P.69 & P.73 


ADRIAN L. MELOTT 


or more than half a century, 
f astronomers have speculated 

that supernovae have occurred 
close enough to Earth to affect the 
planet, possibly contributing to mass 
extinctions or climate change. In this 
issue, two studies of isotopes produced 
by supernovae reveal events thought 
to have happened a few million years 
ago: Wallner et al.' (page 69) report 
measurements of marine deposits that 
contain isotopes produced outside the 
Solar System, whereas Breitschwerdt 
et al.” (page 73) model the transport 
of iron-60 and use this to calculate the 
explosion times and sites of the likely 
sources. It is now possible to ask key 
questions with some precision; for 
example, could these supernovae have 
had substantial effects on Earths cli- 
mate and organisms — and perhaps 
even a role in human evolution? 

The past 500 million years or so have 
witnessed mass-extinction events of 
varying intensity. These include the 
major end-Cretaceous event that 
wiped out the dinosaurs and about half 
of the planet’s other species 66 million 
years ago, and a moderate event at the end of 
the Jurassic period 145 million years ago. In 
1954, it was suggested’ that a nearby super- 
nova caused the greatest mass-extinction 
event, which occurred about 250 million 
years ago (end-Permian) and affected more 
than 90% of species on Earth. For decades 
afterwards, progress in research centred on 
estimates of how often such stellar events 
might happen and what effects, if any, 
there might be on Earth and the interstellar 
medium. Unsurprisingly, these studies lacked 
precision. 

Breakthrough computations* in the 1990s 
showed that, for moderately close explosions 
— those occurring within about 100 par- 
secs (a few hundred light years) of Earth, 
expected every million years or so — radio- 
active nuclides can be transported and depos- 
ited on our planet. The isotopes are carried 
by dust grains that are only weakly affected 
by magnetic fields or gas pressure. Several 
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Figure 1 | Simulations of natural fireworks ona grand scale. 
Breitschwerdt et al.” performed high-resolution numerical 
simulations of supernovae that formed the Local Bubble (foreground, 
right) and the Loop I superbubble (left). Located in the Milky Way, 
these bubbles are large regions of hot, low-density gas surrounded 

by shells of swept-up debris that the Solar System has traversed for 
the past 5 million to 10 million years. In the image, the central solid 
green patch near Earth (represented by a white star) shows the mass 
distribution of the iron-60 isotope associated with the two bubbles 
2.3 million years ago. The authors’ simulations of the trajectory of 
iron-60 expelled from the supernovae agree with measurements 

of the isotope in the deep-sea crust by Wallner and colleagues’. 


candidate isotopes are problematic when 
used as evidence of supernovae: for example, 
beryllium-10 is also produced by radia- 
tion impinging on Earth’s atmosphere, and 
plutonium-244 measurements suffer from 
background contamination originating 
from the testing of nuclear weapons. Never- 
theless, concrete comparisons of estimates 
with measurements became possible for 
the first time. 

In 1999, the detection? of iron-60 in the 
deep-ocean ferromanganese crust broke the 
dam and led to a flood of results. Iron-60 is 
an excellent supernova indicator because it is 
produced abundantly in many types of super- 
nova; non-supernova channels would produce 
only up to one-tenth as much. Nonetheless, the 
detection would not have been possible with- 
out the atom-counting capability of accelerator 
mass spectrometry, which allowed researchers 
to separate different isotopes by mass. This was 
followed by the first detailed computation’ of 
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the effects on Earth’s atmosphere of 
photons and cosmic rays from super- 
nova sources. That study’ established 
a ‘kill distance’ of roughly 8 parsecs, 
within which the effects of these 
high-energy particles on terrestrial 
biota would be catastrophic. Super- 
novae this close should, on average, 
occur as often as once every 800 mil- 
lion years or so’, occasionally inter- 
spersed with additional events due 
to y-ray bursts. 

Breitschwerdt et al. looked at the 
deposition history of iron-60 and 
the trajectories of stars from a likely 
precursor group — the Local Bub- 
ble, a region of hot gas that includes 
both the Solar System and the prob- 
able sites of the supernovae that 
formed the bubble (Fig. 1). They 
find that the nearest explosions were 
approximately 100 parsecs away. 
The authors highlight the main 
sources of uncertainly in their con- 
clusions as being in the supernova 
yield of iron-60 and in its transport 
through the interstellar medium and 
deposition on Earth. Despite these 
uncertainties, their findings are an 
important proof of concept for a uni- 
fied picture of the dynamics of the interstellar 
medium, of the dynamics of the stellar group 
that gave rise to the supernova, and of the 
subsequent deposition of isotopes on Earth. 
The general set of parameters described in the 
study falls within a range suggested in recent 
work’ on analytical estimates of deposition 
processes. 

By using accelerator mass spectrometry on 
three different deep-sea archives, encompass- 
ing four sediment cores, two iron-manganese 
crusts and two iron-manganese nodules, 
Wallner et al. greatly expand the amount of 
available data for iron-60, as well as for alu- 
minium-26 and beryllium-10. They find 
iron-60 levels averaging at about 40 times 
background levels. These results favour super- 
novae as the source, in two main events: one 
at 1.7 million to 3.2 million years ago, and the 
other 6.5 million to 8.7 million years ago. Both 
events were longer in duration than could be 
explained by the passage of a single blast wave. 
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Wallner and co-workers therefore propose that 
either a series of supernova events directly 
affected the Solar System, or that the Solar 
System passed through an interstellar medium 
polluted by the mixed products of multiple 
supernovae. 

Either the direct or indirect picture is 
reasonable, given that the rate of supernovae 
is expected to vary significantly in the dif- 
ferent environments traversed by the Solar 
System during its orbit about the Galaxy. 
The data provided by Wallner et al. will allow 
considerable refinement of models such as 
those of Breitschwerdt and colleagues. It will 
also facilitate research into several puzzling 
characteristics of the cosmic-ray spectrum 
(cosmic rays are mostly accelerated protons) 
that can be explained’ if a supernova occurred 
roughly 2 million years ago at the same dis- 
tance from Earth (roughly 100 parsecs) as 
the supernovae detected by Breitschwerdt 
and colleagues. 

What can we discern about Earth's violent 
past from this broadly unified picture? The 
recorded supernovae occurred beyond the 
kill distance, and no major global mass extinc- 
tions””” coincide with them. However, during 
the suggested interval there was a general 
decline in temperature that culminated in 
the extensive series of glaciations in the 
Pleistocene epoch (from about 2.6 million to 
12,000 years ago), although we do not know 
if there is a link between supernova activ- 
ity and colder temperature. This climatic 
variation may be one of the conditions that 
led to human evolution. Ionization of the 
atmosphere by supernovae may also lead to 
an increase in lightning” and possibly other 
climatic effects. The new studies'” will open 
up unexplored avenues of modelling and 
detailed investigation, providing deeper 
insight into what might have happened on 
Earth over the past 10 million years as a result 
of nearby stellar fireworks. m 
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Fungus produces 
a toxic surprise 


A protein fragment released by filaments of the fungus Candida albicans destroys 
host cells. This is the first demonstration that human fungal pathogens other 
than moulds can release toxic peptides. SEE ARTICLE P.64 


AARON P. MITCHELL 


he fungus Candida albicans, a common 

cause of infection in mucosal tissues, 

forms long filaments called hyphae, 
comprised of tubular cells, that are required 
for virulence in animals’. Hyphal-associated 
adhesion proteins bind to host tissue, which is 
then degraded by hydrolytic enzymes. How- 
ever, until now, no hyphal toxin that dam- 
ages host cells has been identified. Because of 
this, the fungus is considered to be an ‘acci- 
dental’ pathogen that benignly inhabits our 
mucosal tissues, causing tissue damage by 
happenstance rather than design. That view 
must now change drastically. On page 64 of 
this issue, Moyes et al.” reveal a mechanism 
through which hyphae actively wage war on 
our cells — the production ofa toxin that the 
authors call Candidalysin. 

Interactions between C. albicans and epi- 
thelial cells, which line the body’s cavities, 
occur during the early stages of mucosal 
infections such as thrush and vaginitis. 
Hyphae elicit’ several epithelial-cell responses, 
including the production of signalling mol- 
ecules called cytokines that recruit cells of 


Hypha 


Hwpl 
1 Als3 


Epithelial cell 


the immune system to defend tissues, and 
loss of cell integrity through cell-membrane 
deterioration. Moyes et al. discovered that a 
strain of C. albicans in which the gene ECE1 
was mutated could not elicit epithelial-cell 
responses, despite growing apparently normal 
hyphae. Moreover, the authors validated these 
tissue-culture observations in vivo — the ECE1 
mutant was unable to reliably infect mucosa in 
a zebrafish swimbladder model and a mouse 
model of thrush. 

ECE1 was one of the first genes to be 
identified in hyphal-specific expression 
screens more than 20 years ago’, yet until now 
it has been one of the most poorly understood 
genes in C. albicans. In fact, ECE1 is among 
the most highly expressed genes in hyphae’, 
but its role has not previously been investi- 
gated thoroughly because mutants show no 
defects in hyphal morphology or cell pro- 
liferation*®. Thus, the function of the Ecel 
protein has remained a puzzle. 

How does Ecel promote epithelial-cell 
responses? The protein’s amino-acid sequence 
suggests that it is secreted from hyphae as a 
group of eight short protein fragments, or 
peptides, and so would be well positioned to 
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Figure 1 | A toxic relationship. The pathogenic fungus Candida albicans infects its host by forming 
filamentous structures called hyphae. Proteins on the hyphal surface — the adhesin Hwp1, the invasin 
Als3 and its partner Ssal — make contact with the host cell directly or through receptor proteins 

to promote adhesion and engulfment of the hypha by the host. Moyes et al.” report that the protein 
Ecel is secreted from the hypha as eight short peptides. One of these, Candidalysin, acts as a toxin 
that accumulates in the invasion pocket and attacks the host-cell membrane, leading to membrane 


permeabilization and the induction of host defences. 
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interact with host cells. Moyes and colleagues 
confirmed that all eight Ecel peptides are 
secreted from hyphae. Analysis of synthetic ver- 
sions of each peptide revealed that one, Ecel- 
III, elicits the same responses from epithelial 
cells as do hyphae. Moreover, precise deletion 
of the genetic region that codes for only Ecel- 
III created a mutant C. albicans that secreted 
the remaining seven peptides, but did not elicit 
epithelial-cell responses or cause mucosal dis- 
ease in animal models. These results clearly 
demonstrate that Ece1-III mediates the patho- 
genic activity associated with ECE]. 

By what mechanism does Ece1-II] exert 
this activity? Certain chemical and structural 
features indicate that Ece1-III might function 
like peptide toxins, such as the bee-venom 
toxin melittin. Indeed, the authors show that 
the peptide causes rapid and transient permea- 
bilization of artificial cell membranes in vitro. 
These activities are enhanced in the presence 
of cholesterol, a component of animal — but 
not fungal — membranes. The researchers 
conclude that Ece1-III acts as a peptide toxin, 
which they name Candidalysin. 

Moyes and colleagues’ study establishes that 
C. albicans hyphae evolved to damage host 
cells. When combined with our knowledge 
of hyphal adhesin proteins and enzymes, a 
simple program of tissue destruction emerges 
(Fig. 1). First, the hyphal-specific adhesin 
Hwp!1 attaches to mucosal surfaces’. Second, 
the hyphal-specific invasion protein Als3, act- 
ing with the protein Ssal, binds to receptors on 
the surface of the host cell, promoting engulf- 
ment of the hypha by the host cell®. Finally, 
Candidalysin accumulates in the invasion 
pocket around the hypha, attacking the host's 
cholesterol-containing membrane. 

This attack leads to membrane permeabili- 
zation, leakage of cell contents and a defensive 
cytokine response, which serves to limit the 
size of the C. albicans population in healthy 
individuals. However, impaired defences in 
people with conditions such as AIDS, diabetes 
and some cancers permit C. albicans growth 
and consequent disease. 

Every study raises fresh questions, and this 
one is no exception. For instance, why did a 
previous analysis® find that ECE1 was not 
necessary for invasive infection? The study in 
question was rigorous and efficiently tested 
many C. albicans genes using a mixture of 
mutant strains. One possible explanation, 
given that Ecel functions extracellularly, is that 
an ECE] mutant was rescued by neighbouring 
cells that did express the gene. 

Might other human pathogens of the genus 
Candida also produce peptide toxins? The ECE1 
gene is found in only a few other genomes. 
However, other aspects of ECE1 might serve as 
good guides in the hunt for prospective toxin 
genes. For example, a small, highly expressed 
gene product with predicted chemical and 
structural features reminiscent of Ecel would 
be an excellent candidate for a peptide toxin. 
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It remains unclear whether the seven Ece1 
peptides that are secreted along with Can- 
didalysin have a role in host interaction, or are 
just hitchhikers. Given that C. albicans spends 
most of its time as a commensal rather than a 
pathogenic fungus, it seems likely that Candid- 
alysin or the other Ece1 peptides have roles in 
maintaining normal fungus-host interactions. 
Finally, Ecel seems to be a promising drug tar- 
get — inhibitors of its production or activity 
might be effective antifungal agents. Such an 
advance would be welcome, because there is 
currently a dearth of antifungal drugs, and the 
rates of disease and mortality associated with 
C. albicans infections are high”. m 
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sea-level shifts 


An analysis of changes in island topography and climate that have occurred 
since the last glacial maximum 21,000 years ago shows how sea-level change 
has influenced the current biodiversity of oceanic islands. SEE LETTER P.99 


JOSE MARIA FERNANDEZ-PALACIOS 


ince the time of nineteenth-century 

naturalists such as Charles Darwin and 

Alfred Russel Wallace, islands have played 
a crucial part in the development of several 
scientific disciplines. Among these is the 
field of biogeography, which analyses the 
distributions of species and the historical, 
ecological and evolutionary processes behind 
them. In this issue, Weigelt et al.' (page 99) 
explore how the approximately 122-metre 
rise in sea level that occurred between the last 
glacial maximum (LGM), around 21,000 years 
ago, and the present day has contributed to the 
diversity of plant species seen on islands today. 

The equilibrium theory of island biogeo- 
graphy (ETIB) was proposed in 1967 (ref. 2). 
This postulated that the species richness of 
an island depends both on the island’s isola- 
tion, which controls the immigration rate of 
individual organisms from continents, and on 
its area, which controls the extinction rate of 
island species. This theory transformed bioge- 
ography froma mainly descriptive discipline to 
a quantitative and predictive one, and remains 
the most widely cited theory in the field. 

The predictive strength of the ETIB was 
improved by embellishments such as the res- 
cue effect’ and the target-area effect*, and we 
now know that the theory applies well to land- 
bridge islands — those that are separated from 
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the mainland by the current high sea level but 
are actually part of the continental shelf, such 
as Sumatra or Tasmania’. However, the pre- 
dictions of the ETIB fail for volcanic islands 
— those that emerged as a result of the accu- 
mulation of magma on the oceanic crust and 
were never part of a continent, such as Hawaii 
or the Canary Islands. It also fails to work 
for micro-continents — islands that became 
detached from continents after the formation 
of a new oceanic ridge, such as Madagascar or 
New Zealand. 

This failure is attributable mainly to the fact 
that the formation of new species (speciation) 
overrides immigration on volcanic islands 
and micro-continents*. Although these types 
of island may comprise less than 5% of existing 
islands, they contribute disproportionately to 
global biodiversity through their high num- 
bers of endemic (exclusive) species. 

In recent decades, there has been a shift 
from the idea of equilibrium in island biogeo- 
graphy to one of disequilibrium’. This thinking 
has been fuelled by the availability of increas- 
ingly complex and accurate bathymetric 
(underwater topography) charts, which have 
revealed extraordinary numbers of seamounts 
(underwater mountains) and guyots (table- 
shaped seamounts that were once islands but 
were later eroded and/or subsided beneath sea 
level). The general dynamic model* proposes 
that the different developmental stages of a 
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Figure 1 | Consequences of island dynamism. These hypothetical 
topographic configurations show how islands might have been affected by the 
122-metre rise in sea level that has occurred since the last glacial maximum 
(LGM) 21,000 years ago. a, The two islands currently above sea level have 

not experienced substantial transformation since the LGM, such that their 
area and isolation have not shifted significantly as a result of the sea-level rise. 


volcanic island control the biodiversity that it 
harbours. 

Bathymetric charts have also enriched our 
understanding of how glacial cycles during 
the Pleistocene epoch (from around 2.6 mil- 
lion to 11,500 years ago) altered global sea- 
level height, and of the effect of these sea-level 
changes on island dynamism. These cycles 
have influenced islands climates, and thus 
the altitudinal distribution of species and eco- 
systems. They have also contributed to changes 
in marine currents and wind regimes, which 
have affected connectivity, species sources and 
colonization rhythms’. Furthermore, glacial 
cycles affected the geographic configurations 
of islands, causing shifts in area, isolation and 
altitude, provoking island fissions and fusions, 
and inducing the emergence and submergence 
of seamounts”. 

However, although there have been some 
attempts to clarify the role that sea-level shifts 
have had in shaping the biota of some volcanic 
archipelagos’ ““, the real consequences of 
sea-level oscillations for current island 
biodiversity remain obscure. Weigelt and col- 
leagues have taken a key step towards filling 
this knowledge gap. 

The authors analysed 184 oceanic islands 
worldwide, including volcanic islands and 
micro-continents, to see how they have 
changed since the LGM. The researchers 
assessed island area, isolation and elevation, 
the number of islands that existed, and the 
temperature and precipitation on the islands 
during the LGM and today. They also meas- 
ured key components of the diversity of flow- 
ering plants (angiosperms), including the 
number of endemic and native species, and 
the proportion of endemics. 

Weigelt et al. found that the number and 


proportion of endemic angiosperm species 
found today is significantly higher on islands 
that were larger during the LGM than they 
are now (Fig. 1). However, they identified 
only weak effects of changes in temperature 
and precipitation on angiosperm diversity. 
The authors argue that islands that were 
repeatedly larger in the past (low-sea-level 
events occurred several times in the Pleis- 
tocene’’) might have been sites of within- 
island speciation that has left a strong imprint 
on endemism today. By contrast, species 
richness was mostly related to islands’ cur- 
rent environmental characteristics, sug- 
gesting an equilibrium with the present 
conditions. 

Contrary to the significant effect of change 
in island area on endemic species, Weigelt et al. 
found that shifts in island isolation due to sea- 
level rise had no effect on the richness of native 
or endemic species, nor on the proportion of 
endemics. They observe that the difference 
between current island isolation and that at the 
LGM is small, because most oceanic islands 
are quite isolated and, consequently, sea-level 
shifts have not significantly varied their degree 
of isolation. 

The authors conclude that considering 
environmental change during the Late 
Quaternary period (the past 130,000 years) 
is essential to understanding current patterns 
of island endemism and its underlying evolu- 
tionary dynamics. Furthermore, as the authors 
rightly point out, future research is needed on 
other, still poorly understood glaciation-asso- 
ciated shifts, for instance, in ocean currents, 
wind regimes and species source areas; on 
the emergence and disappearance of island- 
summit ecosystems and their ecological and 
evolutionary consequences; and on the part 
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b, The two current islands formed a single and much larger entity during the 
LGM owing to the existence of an island platform. Sea-level rise caused that 
single island to undergo a fission event, resulting in a loss of area and increased 
isolation. Weigelt et al.' find that although the species richness of native plants 
is similar in both scenarios, the percentage of endemic species is larger in b, 
reflecting the islands’ more complex and dynamic evolutionary history. 


that the reiterative emersion and submersion 
of seamounts has played in the past coloniza- 
tion of nearby islands. Such knowledge will 
improve our understanding of how current 
global warming will affect one of the planet's 
vital reservoirs of biodiversity — the oceanic 
islands. m 
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Water’s past revisited 
to predict its future 


Areconstruction of 1,200 years of water’s history in the Northern Hemisphere, 
based on proxy data, fuels the debate about whether anthropogenic climate 
change affected twentieth-century precipitation. SEE LETTER P.94 


MATTHEW E. KIRBY 


great challenges’. Climate models are 

key to these predictions, but such mod- 
els must be validated through comparison 
with, and integration of, measured or proxy- 
based climate data”. On page 94 of this issue, 
Ljungqvist et al.’ take on this challenge by 
reconstructing the past 12 centuries of water 
history in the Northern Hemi- 
sphere from proxy records. Their 
reconstruction for the twentieth 
century diverges sharply from 
climate-modelling results, and 
calls for greater development of 
water-sensitive proxy data and 
their integration into modelling 
efforts. 

Estimating future water avail- 
ability in a warming world 
is crucial, for various socio- 
economic and ecological rea- 
sons’. To compensate for the 
absence of measured data for 
future times, climate models are 
often assessed using twentieth- 
century data and, in many cases, 
inferred data from periods that 
pre-date climate measurements. 
Palaeoclimate data are derived 
from proxies of measured data, 
through the analysis of climate- 
sensitive archives such as ice 
cores, lake sediments and mineral 
deposits in caves (Fig. 1). 

Ljungqvist and colleagues 
undertook a formidable task in 
this area of research. They com- 
piled various water- and temper- 
ature-sensitive proxy data for 
the past 1,200 years, assessed age 
controls for the data, compared 
each data set across regions and 
continents, and then compared 
their proxy results with previ- 
ously reported climate-model 
results. Their reconstruction of 
hydroclimate is generally con- 
sistent with model results for the 
period concerned — for example, 
it indicates a higher percentage 
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of wetter-than-average land (relative to the 
average before the twentieth century) during 
the ninth to eleventh and the twentieth cen- 
turies, and the opposite during the twelfth to 
nineteenth centuries. Persistent precipitation 
dipoles (north-south patterns of precipitation 
anomalies that have opposing signs), such as 
that observed now in the western United 
States”, are also evident. However, the data 
do not capture the climate trends® known as 


Figure 1 | Calcite columns in Lechuguilla Cave, New Mexico. Cave mineral 
deposits can provide proxy data on past precipitation patterns. Ljungqvist et al.’ 
have reconstructed the water history of the Northern Hemisphere for the past 
1,200 years, based on proxies, including mineral deposits. 
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the Little Ice Age and the Medieval Climatic 
Anomaly. 

Perhaps the most notable finding is that the 
proxy-based reconstruction diverges from 
model outputs for the twentieth century. Spe- 
cifically, the proxy results indicate no difference 
in twentieth-century water dynamics compared 
with the pre-industrial era, whereas model 
results indicate the development of a substantial 
bimodal distribution in precipitation between 
extreme wetness and extreme dryness — dry 
gets drier and wet gets wetter (DDWW). 

The DDWW idea is not new’. This 
concept is based largely on the expectation 
that higher global temperatures will intensify 
the water cycle, increasing the average rainfall 
and/or rainfall intensity, as well as amplifying 
net evaporation. This intensification is related 
to the Clausius—Clapeyron equation’, which 
states that moisture content in the lowest 
part of the atmosphere will rise with higher 
temperatures. Already-wet regions might 
therefore receive more-frequent and/or more- 
intense precipitation. Conversely, dry regions 
will become drier as warmer 
temperatures expand the sub- 
tropical dry zones and inten- 
sify areas of high pressure (low 
precipitation). But this picture 
is an oversimplification and 
effects are likely to manifest in 
unforeseen ways’. 

Whether twentieth-century 
water-cycle intensification 
and the associated DDWW 
response have been detected 
is still controversial’. The 
divergence of Ljungqvist and 
colleagues’ hydroclimate 
reconstruction from model 
results for the twentieth cen- 
tury certainly adds fuel to the 
fiery debate. Do their results 
invalidate current predictive 
models? Certainly not. But 
they do highlight a big chal- 
lenge for climate modellers, 
and present major research 
opportunities both for model- 
lers and for climate scientists 
who work with proxy data’. 

For example, the authors 
highlight the continued 
importance of combining 
proxy-based results with cli- 
mate models. This is more eas- 
ily said than done. Not every 
site contains the same types 
of proxy, and the response of 
sources of proxy data to climate 
can change over time. More- 
over, the climatic sensitivity of 
proxy-data sources varies from 
type to type and site to site. 

To address these limitations, 
palaeoclimate scientists must 
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standardize best practices for proxy sampling, 
reconstructions, dating and statistical analyses. 
One priority should be to select key sites for 
proxy sampling and analysis. For example, are 
there crucial geographical gaps that require 
attention and that strike an urgent socio- 
economic or ecological chord*"’? Fortunately, 
several initiatives are already focusing on 
improving efforts to integrate models and 
proxy results, for example the Paleoclimate 
Modelling Intercomparison Project and the 
Past Global Changes project. 

Ljungqvist et al. were, of course, constrained 
by the data available for analysis — indeed, 
their efforts reveal a shocking lack of data. For 
example, Figure 1 of their paper’ highlights the 
vast geographical gaps between proxy sites. 
Immense areas of the Northern Hemisphere 
still require exploration for proxy develop- 
ment, many in highly populated regions. The 
current analysis should therefore be revisited 


as proxy records from these regions become 
available. 

Nevertheless, this research is a crucial first 
step in the use of models and proxy data to 
reconstruct and explain the history of water 
— and not just of temperature change — 
through time. Future research efforts should 
improve, test and extend Ljungqvist and col- 
leagues’ results. Global warming will undoubt- 
edly change Earth's water cycle, so the more 
that is known about the cycle’s past behaviour 
through proxy records, and the better those 
changes can be modelled, the more confidence 
we will have in predictive models as we forge 
ahead into the twenty-first century. m 
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Untangling autism 


Aclever dissection of the roles of the Ptchd1 gene in the brains of mice demonstrates 
one way to untangle the complex relationships between the causes and symptoms 
of neurodevelopmental disorders. SEE ARTICLE P.58 


SCOTT BOLKAN & JOSHUA A. GORDON 


ur current understanding of neuro- 

developmental disorders can be 

thought of as a tangled mess of 
threads. The clinical presentations of such dis- 
orders are so variable that people can receive 
the same diagnosis despite not sharing a single 
symptom. Furthermore, the underlying risk 
factors — be they genetic or environmental 
—can be associated with not just one, but sev- 
eral disorders. Untangling these threads to link 
cause and outcome seems a Sisyphean task, but 
it is a crucial one if we are to develop treatment 
approaches that have a solid neurobiological 
basis. In this issue, Wells et al.’ (page 58) use 
a mouse model of a human genetic condition 
to grasp the loose end of a single thread and 
carefully extricate it from the tangled mess. In 
doing so, they map a precise set of genetically 
linked symptoms onto dysfunction of a neu- 
ronal structure that gates the flow of informa- 
tion across multiple brain circuits. 

Wells and colleagues’ approach is based 
on clinical observations” of a link between 
autism spectrum disorder and mutations 
in the PTCHD1 gene. Autism spectrum 
disorder involves a tremendously debilitating 
disruption of social and cognitive function, 
and is frequently associated with several 
neurodevelopmental diagnoses that have 
overlapping symptoms, including intellectual 


disability and attention deficit hyperactivity 
disorder. 

As is typical for genes associated with 
neurodevelopmental conditions, PTCHD1 
mutations are seen in only a small fraction of 
people with autism spectrum disorder*. Not 
everyone with the risk-associated gene variant 
develops the disorder, but the mutation does 
raise disease risk substantially: more than 
40% of individuals with the mutation develop 
autism-like behaviours, compared with about 
1% of the general population’. Nonetheless, 
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how a single gene can alter brain function to 
produce particular symptoms remains unclear. 

Wells et al. began to unravel these issues 
by generating mice that produce a truncated, 
non-functional form of the PTCHD1 protein. 
The authors report that Ptchd1 mutant mice 
exhibit behavioural abnormalities that are 
largely consistent with those seen in people 
with autism spectrum disorder, including sleep 
disruption, hyper-aggression and deficits in 
attention and learning. 

In seeking to understand how the Ptchd1 
mutation can produce such a broad set of 
symptoms, the authors were struck by the 
observation that, during early postnatal 
development, expression of the gene is highly 
enriched in a neuronal structure called the 
thalamic reticular nucleus (TRN) in the thala- 
mus region of the brain. This structure sends 
inhibitory neuronal projections to all areas of 
the thalamus, and thus exerts powerful control 
over the flow of neuronal activity within and 
across brain circuits that process information 


a b 
1 Sleep PTCHD1 Sleep 
TRN 
2 Attention Attention | 
3 Aggression Aggression 
Other 
control 
a Learning Learning | 


Figure 1 | Tracing the threads between causes and symptoms of autism. a, Autism spectrum disorder 
is caused by a range of environmental and genetic factors (symbolized here by numbers), each of which 
can lead to one or more of many symptoms. Untangling which causes lead to which symptoms is a major 
challenge. b, Wells et al.' have done just that for the PTCHD1 protein, which is mutated in some people 
with autism spectrum disorder. They find that, in mice, PTCHD1 regulates sleep, attention, aggression 
and learning. Furthermore, sleep and attention, but not aggression and learning, are mediated by 
PTCHD1 expression in a neuronal structure called the thalamic reticular nucleus (TRN). 


© 2016 Macmillan Publishers Limited. All rights reserved 


7 APRIL 2016 | VOL 532 | NATURE | 45 


| RESEARCH | NEWS & VIEWS 


about vision, movement, cognition and more. 
The TRN is known as the ‘guardiar’ or ‘gate- 
keeper’ of activity in these thalamic brain 
circuits and has regulatory roles in sleep® and 
attentional processes”®, As such, dysfunction 
in the TRN could plausibly produce wide- 
ranging behavioural abnormalities. 

Focusing on the TRN, Wells et al. traced a 
route from gene to behaviour in Ptchd1 mutant 
mice. The authors monitored the activity of 
TRN neurons, and found fewer bursts of activ- 
ity in mutant animals than in controls. These 
changes were attributable to a drastic reduction 
in the activity of the SK-channel protein, which 
mediates passage of potassium ions across the 
cell membrane and normally promotes activity 
bursts. With reduced levels of channel activ- 
ity, and reduced numbers of bursts, TRN neu- 
rons fail to properly inhibit activity in other 
regions of the thalamus, including the lateral 
geniculate nucleus — the main visual relay 
between the eyes and the brain's visual cortex. 
This failure to properly inhibit thalamic relays 
seems to disrupt visual attention, among other 
behaviours. 

Which of the wide-ranging abnormalities 
seen in the Ptchd1 mutant mice truly depend 
on this disruption of TRN function? To answer 
this question, Wells et al. generated mice that 
lacked the Ptchd1 gene exclusively in the TRN. 
In a remarkable dissociation of symptoms, 
they found that TRN-restricted Ptchd1 mutant 
mice display hyperactivity and deficits in sleep 
and attention, but do not exhibit the learn- 
ing deficits and hyper-aggression observed 
in mice carrying the brain-wide mutation 
(Fig. 1). Moreover, using a pharmacological 
agent to restore SK-channel function in adult 
mice, Wells et al. were able to re-establish activ- 
ity levels in the thalamic relays controlled by 
the TRN. This reversed abnormal hyper- 
activity and deficits in sleep and attention, but 
not impaired learning or hyper-aggression. 
Successful restoration of normal sleep and 
attention in adult animals raises hopes that 
treatments targeting the SK channel might 
benefit people with PTCHD1 mutations, and 
perhaps also those with autism spectrum dis- 
order caused by other factors. 

The authors’ study indicates that both 
TRN-specific deletion of PTCHD1 and sys- 
temic enhancements in SK-channel function 
affect the same subset of behavioural out- 
comes. This consistency strongly supports the 
idea that PTCHD1 exerts its effects on TRN 
function (and therefore on activity, sleep 
and attention) through the SK channel. 
However, the findings also argue that the 
effects of PTCHD1 on learning and aggres- 
sion are caused by an altogether different 
mechanism that both acts outside the TRN 
and is independent of SK-channel func- 
tion. Determining which brain regions and 
biochemical pathways might be responsible 
for these behavioural changes will probably 
be important for fully reversing the deficits 
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caused by PTCHD1 mutations. 

Wells et al. have placed their fingertips on 
the loose thread of PTCHD1 expression, fol- 
lowing it to the TRN, through the SK channel 
and to its end in behaviours associated with 
neurodevelopmental disease. In doing so, they 
have identified the SK channel as a potential 
pharmacological target for treating autism 
spectrum disorder, while simultaneously 
highlighting the usefulness of animal mod- 
els of human genetic conditions for studying 
neurodevelopmental and psychiatric diseases. 
Although the role of PTCHD1 in learning and 
aggression remains elusive, the current paper 
shows one way to navigate the daunting com- 
plexity of neuronal-circuit function to reveal 
the mechanisms by which gene dysfunction 
leads to changes in behaviour. m 
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Breaks in the brain 


A high-throughput approach has found clusters of DNA double-strand breaks in 
neural cells. Most of the clusters are in large genes that are associated with neural 
function, which suggests that the breaks may have tissue-specific roles. 


THOMAS W. GLOVER & THOMAS E. WILSON 


a surprising degree of structural variation 

in human genomes. Structural variants 
(SVs) in germline DNA are now known to 
be a major factor in normal genomic vari- 
ation and an important class of mutation in 
genomic disorders, and they arise frequently 
in cancers. Many SVs are thought to result 
from DNA-replication errors’, so they would 
be expected to occur at a high frequency 
in dividing somatic cells (those that do not 
undergo meiotic division). However, in con- 
trast to heritable SVs in germline cells, little 
is known about somatic SVs and their impact 
on tissue function and disease. Writing in Cell, 
Wei et al.’ provide insight into these questions 
by examining the landscape of DNA double- 
strand breaks (DSBs) that arise in mouse 
neural cells. 

The difficulty in studying somatic SVs 
arises primarily from technical challenges 
in detecting rare events in cell populations. 
Sensitive sequencing technologies coupled 
with bioinformatic tools have begun to pro- 
vide glimpses of somatic SVs in vivo, especially 
in the brain, where up to 40% of individual 
neurons have been found to contain megabase- 
scale copy-number variations (CNVs; a form 
of SV in which the number of copies of a 
genomic region varies between cells or individ- 
uals)**. However, because of the low resolution 
of many approaches, the true prevalence of SVs 
in neurons or other cells, and the mechanisms 


tudies over the past decade have revealed 
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by which they arise, are unknown. 

Wei et al. used a sensitive, targeted assay 
known as high-throughput genomic trans- 
location sequencing. This method allows 
genome-wide detection of naturally occur- 
ring ‘prey’ DSBs using experimentally induced 
‘bait’ DSBs targeted elsewhere in the genome. 
The two DSBs are joined by cellular DNA- 
repair processes, leading to a translocation 
between the genomic regions. Sequencing 
of the resulting breakpoint junction allows 
mapping and characterization of the DSBs at 
nucleotide-level resolution. 

The authors first created DSBs at bait loci 
on three mouse chromosomes in cultured 
neural stem/progenitor cells (NSPCs) that 
lacked the protein Xrcc4, which is essential for 
a DSB-repair process called non-homologous 
end-joining (NHEJ); preventing this joining 
enriches for cells with rearrangements. The 
cells also lacked the protein p53; this lack pro- 
motes cell survival. The researchers identified 
thousands of prey DSBs, with 61% located 
close to the bait DSBs and the rest spread 
across the genome (Fig. 1). Strikingly, many 
were found within three recurrent DSB clusters 
(RDCs); two of these were in the Lsamp and 
Npsa3 genes, which are unusually large genes 
that encode a neural-cell-specific adhesion 
molecule and a transcription factor, respec- 
tively. The third RDC reflected DSBs that 
occurred close to the bait. 

When the authors applied the same pro- 
cedure in mouse B cells (a type of immune 
cell), they found B-cell-specific RDCs but no 


Developing 
brain 


Cultured NSPCs 


Fetal mouse 


Large, actively transcribed, 
late-replicating gene 


Chromosome 
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Translocations 
g Other rearrangements? 


Gain or loss of function? 
Functional consequences? 


Figure 1 | Translocation hotspots. Wei et al.” performed an unbiased genome-wide screen for DNA double-strand breaks (DSBs) in cultured mouse neural 
stem/progenitor cells (NSPCs). Their assay identifies prey DSBs that have joined with experimentally induced bait DSBs to generate inter-chromosomal 
translocations. The researchers found hotspots for recurrent DSBs in large, actively transcribed, late-replicating genes that function in synapses between neural 
cells or in neural adhesion. Many more recurrent DSB clusters were identified when the cells were subjected to replication stress. These DSBs could lead to 
genomic rearrangements that contribute to genetic heterogeneity in different cells of the brain or other tissues, giving rise to functional consequences in vivo. 


translocations involving Lsamp. Transcription 
analysis showed Lsamp expression in NSPCs 
but not B cells, suggesting that the suscepti- 
bility of certain genetic regions to DSBs in 
certain cell types is mechanistically linked to 
transcription. 

The authors then examined the impact of 
DNA-replication stress on DSBs by treating 
NSPCs with low doses of aphidicolin, an inhib- 
itor of DNA polymerase enzymes that is widely 
used to study common fragile site (CFS) insta- 
bility’. This approach identified 24 additional 
RDCs, all in genes, and more than 300 candi- 
date RDCs. In total, 26 of the 27 RDCs were in 
genes larger than 400 kilobases, and 13 were 
in genes larger than 1 megabase, including 
many CFS genes. More than 90% (24 out of 
26) of the RDC genes in NSPCs are involved 
in neural adhesion or the function of synapses 
(connections between neurons), and many 
are implicated in neurodevelopmental and 
neuropsychiatric disorders, including Lsamp, 
Nrnx1 and Dec. All but one of the RDC genes 
were actively transcribed, and all but one com- 
pleted replication late in the S phase of the cell 
cycle. Comparing the sequences at break- 
point junctions between NHEJ-proficient 
and NHEJ-deficient cells indicated that 
both canonical and alternative end-joining 
pathways mediate the translocations. 

These findings of large gene size, active 
transcription, late replication and induction 
by replication stress strongly suggest that inter- 
ference between transcription and replication 
contributes to the high frequency of DSBs 
and genomic rearrangements in NSPCs. An 
earlier study” linked these same factors to 
CNVs and CFSs in human fibroblast cells and 
mouse embryonic stem cells. The study found 
both spontaneous and replication-stress- 
induced CNVs of sizes ranging from 1 kilobase 
to several megabases across the genome, but 
with ‘hotspots’ in large, late-replicating, actively 
transcribed genes, including LSAMP in human 
fibroblasts and other genes with neural func- 
tions. These hotspots were also CFSs — regions 


in which gene structure and slowed or stalled 
replication led to the passage of unreplicated 
DNA into mitotic cell division®” 

Strikingly, hotspot CNVs clustered in the 
centre of large genes ina pattern similar to the 
distribution of RDC breakpoints identified by 
Wei and colleagues. This instability was again 
evident only when the genes were transcribed. 
There was also considerable overlap in the 
genes identified in these two studies, with 5 out 
of 9 (56%) of the human CNV hotspots corre- 
sponding to RDC genes and 52% of the RDC 
genes showing CNVs in mouse embryonic 
stem cells. A key insight from Wei and col- 
leagues’ results is that instability at these sites is 
associated with DSBs. In addition to giving rise 
to translocations, the DSBS detected by Wei 
et al., which probably arise from stalled repli- 
cation forks that form during DNA synthesis, 
might also lead to CNVs through end-joining 
or other DNA-repair pathways. Understand- 
ing the disposition of transcription-dependent 
DNA lesions between different repair pathways 
and their coordination in the cell cycle are 
challenges for future studies. 

In addition to mechanistic insights, Wei and 
colleagues’ findings have implications for our 
understanding of genome stability in the brain 
and other tissues. It has been speculated that 
somatic genomic rearrangements contribute 
to neuronal diversity, learning and memory’, 
processes in which many large RDC-associated 
genes function. Studies of SVs in placenta, skin 
and other tissues have also suggested essential 
roles in cell biology”"®. The research group 
behind the current study previously showed" 
that NHEJ is essential for NSPC differentia- 
tion, which implies a role for NHEJ-mediated 
DSB repair during normal brain development. 
It is possible that subpopulations of cells in 
the brain display the same types of rearrange- 
ments, and that some are positively selected. 
Similar studies in vivo and in other tissues are 
needed to test this hypothesis. 

The fact that many of the largest mamma- 
lian genes are transcribed in the brain and have 
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neuronal functions supports the potential for 
brain-specific SVs. It is enticing to speculate 
that programmed SV formation could explain 
why large gene structure is evolutionarily 
conserved despite being potentially unstable. 
However, an alternative explanation is length- 
dependent fine-tuning of gene expression 
through protein binding to brain-specific 
DNA modifications’”. We also need to under- 
stand what other factors contribute to DSBs 
and SV formation at large transcribed genes 
and why some are more susceptible than 
others. Finally, knowledge of the full spec- 
trum of SVs found in the brain and other tis- 
sues will be required to discern their potential 
consequences. Many de novo SVs are likely to 
be deleterious, but variants that lead to gained 
or altered functions would be of immense 
interest. m 
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Climate-smart soils 


Keith Paustian!?, Johannes Lehmann’, Stephen Ogle**, David Reay°, G. Philip Robertson® & Pete Smith’ 


Soils are integral to the function of all terrestrial ecosystems and to food and fibre production. An overlooked aspect 
of soils is their potential to mitigate greenhouse gas emissions. Although proven practices exist, the implementation of 
soil-based greenhouse gas mitigation activities are at an early stage and accurately quantifying emissions and reductions 
remains a substantial challenge. Emerging research and information technology developments provide the potential for 
a broader inclusion of soils in greenhouse gas policies. Here we highlight ‘state of the art’ soil greenhouse gas research, 
summarize mitigation practices and potentials, identify gaps in data and understanding and suggest ways to close such 


gaps through new research, technology and collaboration. 


vidence points to agriculture as the first instance of human- 

caused increases in greenhouse gases (GHGs), several thousand 

years ago’. Agriculture and associated land-use change remain a 
source for all three major biogenic GHGs: carbon dioxide (COz), meth- 
ane (CHy), and nitrous oxide (NO). Land use contributes ~25% of total 
global anthropogenic GHG emissions: 10%-14% directly from agricul- 
tural production, mainly via GHG emissions from soils and livestock 
management, and another 12%-17% from land cover change, including 
deforestation”*. Although soils contribute a major share (37%; mainly 
as N,O and CH,) of agricultural emissions’, improved soil management 
can substantially reduce these emissions and sequester some of the CO, 
removed from the atmosphere by plants, as carbon (C) in soil organic 
matter (in this Perspective, our discussion of soil C refers solely to 
organic C). In addition to decreasing GHG emissions and sequestering C, 
wise soil management that increases organic matter and tightens the 
soil nitrogen (N) cycle can yield powerful synergies, such as enhanced 
fertility and productivity, increased soil biodiversity, reduced erosion, 
runoff and water pollution, and can help buffer crop and pasture systems 
against the impacts of climate change’. 

The inclusion of soil-centric mitigation projects within GHG offset 
markets®® and new initiatives to market ‘low-carbon’ products’ indicate 
a growing role for agricultural GHG mitigation®. Moreover, interest in 
developing aggressive soil C sequestration strategies has been height- 
ened by recent assessments, which project that substantial terrestrial 
C sinks will be needed to supplement large cuts in GHG emissions to 
achieve GHG stabilization levels of 450 parts per million CO, equivalent 
or below, consistent with the goal of a mean global temperature increase 
of less than 2 °C (ref. 9). Soil C sequestration is one of a few strategies 
that could be applied at large scales? and potentially at low cost; as an 
example, the French government has proposed!” to increase soil C 
concentration in a large portion of agricultural soils globally, by 0.4% 
per year, in conjunction with the Conference of the Parties to the UN 
Framework Convention on Climate Change (UNFCCC) negotiations in 
December 2015. This would produce a C sink increase of 1.2 petagrams 
(Pg) of C per year (ref. 10). 

An extensive body of field, laboratory and modelling research over 
many decades demonstrates that improved land use and management 
practices can reduce soil GHG emissions and increase soil C stocks. 
However, implementing effective soil-based GHG mitigation strate- 
gies on a large scale will require the capacity to measure and monitor 
GHG reductions with acceptable accuracy, quantifiable uncertainty and 


at relatively low cost. Targeted research to improve predictive models, 
expanded observational networks to support model validation and 
uncertainty bounds, ‘Big Data approaches to integrate land use, man- 
agement and environmental drivers, and technologies with which to 
engage actively with land users at the grass-roots level, are all key ele- 
ments in realizing the potential GHG mitigation from climate-smart 
agricultural soils. 


Process controls and mitigation practices 
Soil C sequestration via improved management 
Soils constitute the largest terrestrial organic C pool (~1,500 PgC toa 
depth of 1m; 2,400 PgC to 2m depth"’), which is three times the amount 
of CO; currently in the atmosphere (~830 Pg C) and 240 times the current 
annual fossil fuel emissions (~10 Pg)*. Thus, increasing net soil C storage 
by even a few per cent represents a substantial C sink potential. 
Proximal controls on the soil C balance include the rate of C addition 
as plant residue, manure or other organic waste, minus the rate of C loss 
(via decomposition). Hence, C stocks can be increased by increasing 
organic matter inputs or by reducing decomposition rates (for example, 
by reducing soil disturbance), or both, leading to net removal of C from 
the atmosphere’”. However, soil C accrual rates decrease over time as 
stocks approach a new equilibrium. Therefore net CO, removals are of 
limited duration, often attenuating after two to three decades!}, 
Unmanaged forests and grasslands typically allocate a large fraction 
of their biomass production belowground and their soils are relatively 
undisturbed; accordingly, native ecosystems usually support much higher 
soil C stocks than their agricultural counterparts, and soil C loss (typically 
0.5 to >2 megagrams (Mg) of C per hectare per year) following land 
conversion to cropland has been extensively documented'*!°. Total losses 
once the soil approaches a new equilibrium are typically ~30%-50% 
of topsoil (for example, 0-30cm depth) C stocks’*. Hence, avoiding 
conversion and degradation of native ecosystems is a strong mitigation 
alternative. Conversely, restoration of marginal or degraded lands to 
perennial forest or grassland increases soil C storage (Fig. 1), although 
usually at a slower rate than the original conversion losses'*!”. Restoring 
wetlands that have been drained for agricultural use reduces ongoing 
decomposition losses, which can be as high as 5-20 MgCha7'yr7! 
(ref. 18), and can also restore C sequestration (Fig. 1), though methane 
emissions may increase!®°, Land-use conversions may, however, conflict 
with agricultural production and food security objectives, entailing the 
need for a broad-based accounting of net GHG implications (Table 1)”. 
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University, Fort Collins, Colorado, USA. °School of Geosciences, University of Edinburgh, Edinburgh, UK. ®W. K. Kellogg Biological Station and Department of Plant, Soil and Microbial Sciences, 
Michigan State University, East Lansing, Michigan, USA. “Institute of Biological and Environmental Sciences, University of Aberdeen, Aberdeen, UK. 


7 APRIL 2016 | VOL 532 | NATURE | 49 


© 2016 Macmillan Publishers Limited. All rights reserved 


| RESEARCH | PERSPECTIVE 


Baseline condition 


Mitigation practices 


Degraded or (i) Convert to perennial 


A —_—————_—_> : 
marginal land? vegetation 
eee eile —— (ii) Restore to wetland 
organic (histosol) soils? 
Severe nutrient (iii) Add nutrients; add lime; 
SUE aol 


deficiency? 


y co 


5 (iv) Grow cover crops; reduce 
or vegetate fallow fields 


grow N-fixing species 


Extensive bare fallow 


y ~—— 


(v) Reduce to economic- 


Excess N fertilizer use? =3§=————————> ; 
optimal rates 
inteneivertilisde?, (vi) Reduce or halt tilling; 
9 implement residue retention 
Suboptimal N (vii) Improve timing and 
management? > _ placement; use enhanced- 


efficiency fertilizer 


y a 


(viii) Rotate perennials; use 
—— agroforestry; use high-C input 
species; grow cover crops 


Low residue crops? 


y 


Available exogenous 
organic amendments? 


———— 


(ix) Add amendments such as 
compost and biochar 


oe 


\ No — > Yes ——> Practices implemented 


Figure 1 | Decision tree for cropland GHG mitigating practices. 

(Rice is not included.) For degraded, marginal lands, the most productive 
mitigation option is conversion to perennial vegetation either left 
unmanaged or sustainably harvested to offset fossil energy use (cellulosic 
biofuels). Histosol is soil with very high organic matter content, such 

as from peat bog. For more arable lands, multiple options could be 
implemented sequentially or in combination, depending on management 
objectives, cost and other constraints (see Table 1). The practices shown 
(see Table 1 and text for more discussion) are roughly ordered from 
lower-cost or higher-feasibility options towards more costly interventions 
(bottom of figure). 


In general, soil C sequestration rates on land maintained in agri- 
cultural use are lower than for land restoration to grassland or forest, 
and vary on the order of 0.1-1 Mg Cha“ yr~’, as a function of land- 
use history, soil or climate conditions, and the combination of man- 
agement practices applied”. Practices that increase C inputs include 
(1) improved varieties or species with greater root mass to deposit C in 
deeper layers where turnover is slower’, (2) adopting crop rotations that 
provide greater C inputs”, (3) more residue retention”, and (4) cover 
crops during fallow periods to provide year-round C inputs (Fig. 1)7>”°. 
Cover crops can also reduce nutrient losses, including nitrate that is oth- 
erwise converted to N2O in riparian areas and waterways”°—an example 
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of synergy between practices that sequester C and also tighten the N cycle 
to limit emissions of N,O (Table 1). Other practices to increase C inputs 
include irrigation in water-limited systems’? and additional fertilizer 
input to increase productivity in low-yielding, nutrient-deficient systems 
(Fig. 1)”. Although additional nutrient and water inputs to boost yields 
may increase non-CO, emissions’®, the emissions intensity of the 
system (GHG emissions per unit yield) may decline, providing a global 
benefit if the yield increase avoids land conversion for agriculture 
elsewhere”?, 

Many croplands can sequester C through less intensive tillage, particu- 
larly zero tillage’, owing to less disruption of soil aggregate structure”. 
Some authors have argued that benefits are small because increased C 
content in surface horizons are offset by C losses deeper in the profile*®, 
although others have noted that the larger variability in sub-surface 
horizons and lack of statistical power in existing studies makes such 
conclusions questionable*!. 

A change from annual to perennial crops typically increases below- 
ground C inputs (and soil disturbance is reduced), leading to C 
sequestration’®. In grasslands, soil C sequestration can be increased 
through optimal stocking/grazing density. Improved management in 
fire-prone ecosystems via fire prevention or prescribed burning can also 
increase C sequestration”. 

Key knowledge gaps that affect our understanding of soil C seques- 
tration processes and management options to implement them include 
questions about the differential temperature sensitivities of C turnover 
among soil organic matter fractions*, interactions among organic matter 
chemistry, mineral surface interactions and C saturation®”°”, and subsoil 
(>30cm) soil organic matter accretion, turnover and stabilization*®. 
Landscape processes, particularly the impact of erosion and lateral 
transport of C in sediments, contribute additional uncertainty on net 
sequestration occurring at a specific location”. And emerging evidence 
that stabilized soil organic matter is of microbial rather than direct plant 
origin? may offer a potential to manipulate the soil-plant microbiome 
to enhance C sequestration in the rhizosphere. 


Soil C sequestration via exogenous C inputs 

Addition of plant-derived C from external (that is, offsite) sources such 
as composts or biochar can increase soil C stocks, and may result in net 
CO, removals from the atmosphere (Fig. 1). Both compost and biochar 
are more slowly decomposed compared to fresh plant residues, with com- 
posts typically having mean residence times several times greater than 
un-composted organic matter*!, and biochar mineralizes 10-100 times 
more slowly than uncharred biomass”. Thus a large fraction of added C— 
particularly for biochar—can be retained in the soil over several decades 
or longer, although residence times vary depending on the amendment 
type, nutrient content and soil conditions*® (such as moisture, tempera- 
ture and texture). 

However, because the organic matter originates from outside the eco- 
system ‘boundary; a broader life-cycle assessment approach is needed, 
that considers the GHG impacts of: (1) offsite biomass removal, trans- 
port, and processing; (2) alternative end uses of the biomass; (3) inter- 
actions with other soil GHG-producing processes; and (4) synergies 
between these soil amendments and the fixation and retention of in situ 
plant-derived C8, In many cases, net life-cycle emissions will largely 
depend on whether the biomass used as a soil amendment would have 
otherwise been burnt (either for fuel, thereby offsetting fossil fuel use, 
or as waste disposal), added to a landfill, or left in place as living biomass 
or detritus*4. 

While slower mineralization of the amendment is an important 
determinant of net mitigation impact, effects on other soil emissions 
cannot be neglected. Mineralization of existing soil C in response to 
amendments (often referred to as ‘priming”*») has often been observed 
immediately following biochar addition, but priming usually declines, 
sometimes becoming negative (that is, inhibiting in situ soil C decompo- 
sition), over time**”, Analogous time dependence of soil NO and CH, 
emissions has not received sufficient attention*!. Increased plant growth 
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Table 1 | Co-benefits, relative costs and constraints for the mitigation practices shown in Fig. 1 


Mitigation Practice Relative cost Constraints and caveats 
practices co-benefits 
Developed Less developed 
(i) Soil erosion $$ $$ Alternate land/livelihood for subsistence farmers; opportunity cost of removing land 
Biodiversity base; potential for leakage (that is, land-use change impact) 
Water quality 
(ii) Biodiversity $$$ $$$ High opportunity cost of lost crop production; potential increase in CH4 emissions; 
Water quality potential for leakage (that is, land-use change impact) 
(iii) Food security $ $$ Availability or access to fertilizer; potential increase in N20 emissions 
Water quality 
(iv) Soil erosion $ $$ Limited applicability in dry areas 
Water quality 
Soil health 
Food security 
(v) Water quality $ $ Risk of crop production loss 
(vi) Soil erosion $ $$ Limited applicability in cold climates; potential increased equipment cost; increased 
Water quality herbicide use 
Soil health 
(vii) Water quality $$ $$$ Availability or access to enhanced efficiency fertilizer 
(viii) Biodiversity $$$ $$ Less applicability in dry areas and shallow soils; potential opportunity costs of lost crop 
Water quality production 
Soil health 
(ix) Soil health $$$ $$ Dependent on life-cycle emissions of producing the amendment 
Food security 


Mitigation practices are numbered according to Fig. 1. Co-benefits include non-GHG ecosystem services from practice implementation. Relative costs are provided as examples based on a developed 
region such as North America and a less developed region such as sub-Saharan Africa; however, a specific option in one region may have a higher cost or be a less feasible option in another region. 
Potential constraints include factors that might limit or preclude practice adoption or increase other GHG emissions as a consequence of practice adoption. All options require a region-specific full-cost 
carbon accounting (GHG life-cycle analysis) that considers potential indirect land-use effects in order to define specific mitigation potentials. 


in amended soils and the resultant feedbacks to soil C can make up a 
large proportion of the soil-based GHG balance*** and these feedbacks 
may be especially important for more persistent amendments, because 
of the longer duration of any effects. 


Soil management to reduce N2O emissions 

Arable soils emit more N2O to the atmosphere than any other anthropo- 
genic source”; some 4.2 teragrams (Tg) ofa global anthropogenic flux of 
8.1TgN,O-N yr‘. Reducing this flux represents a substantial mitigation 
opportunity, particularly since N2O is often the major source of radiative 
forcing in intensively managed cropland. Better N management to reduce 
emissions would also ameliorate other environmental problems such as 
nitrate pollution of ground and surface waters caused by excess reactive 
N in agroecosystems (Fig. 1, Table 1). 

N20 is produced in soils by microbial activity—mainly nitrification 
and denitrification—which occurs readily when stimulated by the abun- 
dant N that cycles rapidly in virtually all agroecosystems. During nitri- 
fication, ammonium added as fertilizer, fixed from the atmosphere by 
legumes, or mineralized from soil organic matter, crop residue, or other 
inputs is oxidized to nitrite and eventually to nitrate in a series of reac- 
tions that can also produce N,O. Likewise, when denitrifiers use nitrate 
as an electron acceptor when soil oxygen is low, NO is an intermediate 
product that can readily escape to the atmosphere. 

Arable soils managed to support high crop productivity have the capac- 
ity to produce large quantities of N2O, and fluxes are directly related to 
N inputs. On average, about 1% of the N applied to cropland is directly 
emitted as N20 (ref. 49), which is the basis for estimating emissions 
using default Intergovernmental Panel on Climate Change methods". 
However, recent evidence suggests that this value is too high for crops that 
are under-fertilized and too low for crops that are fertilized liberally”*. 
When crops compete with microbes for available N, N2O fluxes are 
lower. In addition to direct in-field emissions, high N applications cause 
N losses from leaching and volatilization that contribute to ‘indirect’ N,O 
emissions, downstream or downwind from the field®”. 

Since N20 has no significant terrestrial sink, abatement is best 
achieved by attenuating known sources of NO emissions, by altering the 


environmental factors that affect N2O production (soil N, oxygen, and C) 
or by biochemically inhibiting conversion pathways using soil additives. 
For example, nitrification can be inhibited with commercial additives 
such as nitrapyrin and dicyandiamide, which slow ammonium oxidation, 
and field experiments suggest that inhibitors can reduce NO fluxes by 
up to 40% in some soils, although other soils show little reduction and 
more research is needed to understand variable site-level responses*!. 
Likewise, tillage and water management can affect NO fluxes by altering 
the soil microenvironment”. 

Another means of reducing N2O emissions from arable soils is more 
precise N management to minimize excess N not used by the crop, while 
maintaining sustainable high yields. Fertilized crops typically take up less 
than 50% of the N applied; the remainder is available for loss. According 
to one recent study™4, corn farmers in the midwestern states of the USA 
could reduce N20 loss by 50% with more conservative fertilizer practices. 
Nitrogen conservation can be achieved by: (1) better matching applica- 
tion rates of N to crop needs using advanced statistical and quantitative 
modelling; (2) applying fertilizer at variable rates across a field based 
on natural patterns of soil fertility, or within the root zone rather than 
broadcast on the soil surface; and (3) applying fertilizer close to when the 
crop can use it, such as several weeks after planting, or adding it earlier 
but using slow-release coatings to delay its dissolution (Fig. 1, Table 1)*°. 

High temporal and spatial variability make predictions of changes in 
NO fluxes in response to management surprisingly difficult. Particularly 
lacking are empirical data for multi-intervention strategies that may 
interact in unexpected ways. Aligned to this paucity are gaps in our 
understanding of how N cycling and net N2O flux in managed soils will 
respond to future climate change*’. The limited number of field manip- 
ulation studies to date indicate that changing temperature and precipi- 
tation patterns may have large and strongly coupled effects on net N20 
emissions*, yet our understanding of the processes that underpin these 
effects and their robust representation in models is far from complete. 


Soil management to reduce CH, emissions 
More than one-third (>200 Tg yr !)? of global methane (CH,) emis- 


sions occur through the microbial breakdown of organic compounds in 
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Figure 2 | Global potential for agricultural-based GHG mitigation 
practices. Management categories are arranged according to average 

per hectare net GHG reduction rates and potential area (in millions of 
hectares) of adoption (note log-scales). Unless otherwise noted, estimates 
are from ref. 19, based on cropland and grassland area projections 

for 2030. Ranges given in units of total Pg CO(eq) yr_' represent 

varying adoption rates as a function of C pricing (US$20, US$50 and 
US$100 per Mg CO,(eq)), to a maximum technical potential—that is, 

the full implementation of practices on the available land base. Multiple 
practices are aggregated for cropland (for example, improved crop 
rotations and nutrient management, reduced tillage) and grazing land 

(for example, grazing management, nutrient and fire management, species 
introduction) categories. Practices that increase net soil C stocks or reduce 
emissions of N,O and CH, are combined in each practice category. 


soils under anaerobic conditions”’. As such, wetlands (177-284 Tg yr‘) 
and rice cultivation (33-40 Tgyr')° represent the largest soil-mediated 
sources of CHy globally. In contrast, well aerated soils act as sinks for 
CH, (estimated at ~30 Tg yr~') from the atmosphere via CH, oxidation, 
the bulk of this net sink being in unmanaged upland and forest soils**. 

Key determinants of soil CH, fluxes include aeration, substrate 
availability, temperature and N inputs>’; therefore, soil management 
can radically alter CH, fluxes. For example, in most soils, conversion to 
agriculture severely restricts CHy oxidation, related to the suppression 
of methanotrophs by accelerated N cycling™. In flooded rice, alterations 
in drainage regimes and organic residue incorporation could reduce 
emissions by ~25% or 7.6 Tg CH, yr! globally’, although cycles of 
wetting and drying of soils may also enhance N2O production®! and 
soil C mineralization®, thereby reducing the net mitigation effect. 

With global rice production projected to expand by ~40% between 
2000 and 2023 (ref. 63), the potential for further GHG mitigation via soil 
management appears to be large, although the global distribution and 
diverse nature of rice production systems—including irrigated, rain-fed 
and deep-water—present challenges to developing effective mitigation 
strategies. For longer-term (>20 year) projections, climate change and 
land-atmosphere interactions become increasingly important, with 
changes in N inputs, temperature, precipitation and atmospheric CO, 
concentration all likely to affect net CH, fluxes from soils™. 

This uncertainty highlights important gaps in understanding key 
processes and their underlying controls. The restoration of soil CH4 
uptake following agricultural conversion, for example, appears related 
to methanotroph community diversity®, about which we know too 
little. Likewise the abatement of CH, generation in rice rhizospheres 
is related to C compounds exuded by roots, such that CH, mitiga- 
tion might be achieved through further rice breeding and genetics™. 
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The portion of projected mitigation from soil C stock increase (about 90% 
of the total technical potential) would have a limited time span of 20-30 
years, whereas non-CO, emission reduction could, in principle, continue 
indefinitely'®. Estimates for biochar application®’ represent a technical 
potential only, but it is based on a full life-cycle analysis applicable over a 
100-year time span. Although global estimates of the potential impact of 
enhanced root phenotypes for crops have not been published, a first-order 
estimate of about 1 Pg CO (eq) yr! is shown, using the global average C 
accrual rates (0.23 MgC ha 'yr~') for cover crops”’, applied to 50% of the 
cropland land area used by ref. 19. ‘Setaside’ land is arable land, usually for 
annual crops, that is taken out of production and converted to perennial 
vegetation (often grassland) and not actively managed for agricultural 
production, such as conservation reserves. 


Limited availability of field-scale CH, flux data means a greater reli- 
ance on regionally averaged emission factors and extrapolation from 
mesocosm and laboratory incubations!%, and thus less site and condi- 
tion specificity in modelling fluxes. Importantly, establishing the net 
climate forcing effects of any intervention is a prime target for future 
soil management research. 


Global potential for soil GHG mitigation 

How important, in total, is this large, varied set of land-use and man- 
agement practices as a GHG mitigation strategy? One of the challenges 
in answering this question is to distinguish between what is technically 
feasible and what might be achieved given economic, social and policy 
constraints. A comprehensive global analysis of agricultural practices'® 
combined climate-stratified modelling of emission reductions and soil 
C sequestration with economic and land-use change models to estimate 
mitigation potential as a function of varying ‘C prices’ (reflecting 
a social incentive to pay for mitigation). They estimated total soil 
GHG mitigation potential ranging from 5.3 Pg CO2(eq) yr_* (without 
economic constraints) to 1.5 Pg CO2(eq) yr at the lowest specified 
C price (US$20 per Mg of CO,(eq)). Average rates for the majority of 
management interventions are modest, <1 Mg CO,(eq) ha~! yr~!. Thus, 
achieving large global GHG reductions requires a substantial proportion 
of the agricultural land base (Fig. 2). Although the economic and man- 
agement constraints on biochar additions (not assessed by ref. 19) are 
less well known, ref. 67 estimated a global technical potential of 1-1.8 Pg 
CO (eq) yr! (Fig. 2). 

A more unconventional intervention that has been proposed is the 
development of crops with larger, deeper root systems, hence increasing 
plant C inputs and soil C sinks?***. Increasing root biomass and select- 
ing for root architectures that store more C in soils has not previously 
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been an objective for crop breeders, although most crops have sufficient 
genetic plasticity to alter root characteristics substantially” and selection 
aimed at improved root adaptation to soil acidity, hypoxia and nutrient 
limitations could yield greater root C inputs as well as increased crop 
yields®*. Greater root C input is well recognized as a main reason for 
the higher soil C stocks maintained under perennial grasses compared 
to annual crops’*. Although there are no published estimates of the 
global C sink potential for ‘root enhancement of annual crop species, 
as a first-order estimate, a sustained increase in root C inputs might 
add ~1PgCO,(eq) yr or more if applied over a large portion of global 
cropland area (Fig. 2). 

Thus, the overall mitigation potential of existing (and potential future) 
soil management practices could be as high as ~8 Pg CO2(eq) yr_*. How 
much is achievable will depend heavily on the effectiveness of imple- 
mentation strategies and socioeconomic and policy constraints. A key 
strength is that a variety of practices can often be implemented on the 
same land area, to leverage synergies, while avoiding offsetting effects 
for different gases (Fig. 1). But regardless of which combination of man- 
agement interventions are pursued, effective policies, that incentivize 
land managers to adopt them, will be needed. A common thread across 
implementation strategies is the role for strong science-based metrics 
with which to measure and monitor performance. 


Implementation of mitigation practices 

Relative to many other GHG source categories, agricultural soil GHG 
mitigation presents particular challenges. Rates on an individual land 
parcel are often low, but vast areas of land are devoted to agriculture 
globally, and the implementers of mitigation practices—the people using 
the land—number in the billions. Therefore, engaging a substantial 
number of these people is a massive undertaking in itself. Furthermore, 
agricultural soil GHG emissions are challenging to quantify owing to 
their dispersed and variable nature and the multiplicity of controlling 
factors—operating across heterogeneous landscapes. Direct measure- 
ment of fluxes requires specialized personnel and equipment, normally 
limited to research environments, and hence not feasible for most mit- 
igation projects. Model-based methods, in which emission rates are 
quantified as a function of location, environmental conditions and 
management, provide a more feasible approach**”%”!, Process-based 
models, which dynamically simulate mechanisms and controls on fluxes 
as a function of climatic and soil variables and management practices, 
and empirical models based on statistical analysis of field-measured flux 
rates, represent differing but complementary approaches. In general, 
model-based quantification systems enable monitoring to focus on 
practice performance and thus dramatically reduce transaction costs 
for implementing mitigation policies”. 

Several implementation strategies for soil GHG mitigation exist 
(see Box 1), all of which require robust quantification and monitoring 
technologies. Those requiring the most rigorous methods involve offset 
projects participating in cap-and-trade markets, in which land manag- 
ers are directly compensated for achieving emission reductions. Other 
market-linked strategies, such as ‘green labelling’ systems for agricultural 
products, will also require rigorous yet easy-to-use GHG quantification 
tools, enabling agricultural producers to meet standards set by product 
distributors and accepted by consumers”. 

Within the voluntary C offset market space, there are a growing num- 
ber of projects that include soil GHG mitigation components’. Several 
large projects focus on preventing land conversion (that is, from forest 
and grassland), thus avoiding large CO2 emissions from soils and liq- 
uidated biomass C stocks. Relatively simple empirical models supple- 
mented with field measurements are commonly used for avoided land 
conversion projects. For more complex land-use projects, empirical 
models are less suited to capturing interactions across multiple emis- 
sion sources, and may over- or under-credit projects where a practice 
has an influence on multiple emission sources. There are relatively fewer 
projects targeting GHG mitigation on existing agricultural lands, involv- 
ing a broader suite of soil management practices, and early pilot-phase 
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BOX | 
Implementing soil GHG reductions 


Incentives for farmers to adopt alternative practices that 
mitigate GHGs can take a variety of forms, including: 

(1) Regulation and taxation. Direct regulatory measures intended 
to reduce soil GHGs at the farm scale are probably politically 
unfeasible and costly. Taxation of N fertilizer, already used in parts of 
the USA and Europe to reduce nitrate pollution, could function as an 
indirect tax that would reduce N2O emissions. 

(2) Subsidies. Targeted government payments or subsidies for 
implementing GHG-reducing practices is emerging as a policy 
alternative. For example, US Department of Agriculture programmes 
are including GHG mitigation as a conservation goal and provisions 
in the EU Common Agricultural Policy link subsidy payments to 
‘cross-compliance’ measures that include maintenance of soil 
organic matter stocks?%. A more direct link to soil GHG emissions 
follows from a recent decision to include cropland and grassland in 
EU commitments under the Kyoto Protocol!°°. 

(3) Supply-chain initiatives. Major food distributors are targeting 
sustainability metrics, including low GHG footprints, as a consumer 
marketing strategy (see refs 101 and 102 for examples of initial 
efforts involving agricultural producers and product distributors and 
retailers), setting performance standards for contracted agricultural 
producers, including the requirement of field-scale monitoring of 
production practices and quantification of GHG emissions. 

(4) Cap and trade. In a cap-and-trade system, emitters are 
subject to an overall emissions level or ‘cap’, in which permitted 
emissions decrease over time. Emitters can stay below the capped 
levels by reducing their own emissions or by purchasing surplus 
permits from capped entities that have exceeded their required 
reductions. Both compliance and voluntary markets can function 
as cap-and-trade systems!°?. Within many cap-and-trade systems, 
a limited amount of emission reductions (termed ‘offsets’) can 
be provided by non-capped entities. The inclusion of agricultural 
activities as offset providers has been growing, particularly within 
voluntary markets. To maintain the integrity of emission caps, key 
criteria for offset providers include demonstrating additionality 
(that is, ensuring that reductions result from project interventions 
and not simply business-as-usual trends), avoiding leakage (that 
is, unintended emission increases elsewhere as a consequence of 
the project activities), and providing for permanence (meaning that 
increased soil C storage, credited as a COz2 removal, is maintained 
long-term). 


NO and CH, reduction projects are only now being developed>™. Here, 
accurately quantifying C sequestration and/or emission reductions is 
more challenging owing to lower rates of change relative to baseline 
conditions, thus requiring more sophisticated models and supporting 
research infrastructure (Fig. 3). 

Another challenge for projects on existing agricultural lands is obtain- 
ing and processing the management activity data. For example, the 
Kenya Agriculture Carbon Project (KACP) involves a total of 60,000 
individual small-holder farmers’’. In contrast to projects involving 
major land-cover changes, where remote sensing can provide much of 
the activity monitoring (for example, retention of forested land over 
time), remote-sensing options are poorly suited for monitoring crop 
type, fertilizer, residue and water management, and organic matter 
amendments”4; for such practices the best sources of information are 
the land managers themselves (Fig. 3). 

Thus another option is to engage land managers as informa- 
tion providers. Examples of this approach are the Cool Farm Tool 
(http://www.coolfarmtool.org/)””, being used by farmers participat- 
ing in low C supply chain management, and the COMET-Farm tool 
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Figure 3 | Expanding the role of agricultural soil GHG mitigation will 
require an integrated research support and implementation platform. 
Targeted basic research on soil processes, expanding measurement and 
monitoring networks, and further developing global geospatial soils 

data can improve predictive models and reduce uncertainties. Ongoing 
advances in information technology and complex system and “Big Data’ 
integration offer the potential to engage a broad-range of stakeholders, 
including land managers, to ‘crowd-source’ local knowledge of agricultural 


(http://cometfarm.nrel.colostate.edu), which allows farmers to compute 
full farm-scale GHG budgets, for support of government-sponsored 
conservation initiatives and participation in mitigation projects”. Both 
tools provide web-based interfaces designed for non-specialists to enter 
land management information; Cool Farm utilizes empirical emission 
factor-type models, while COMET-Farm incorporates both empirical 
and process-based models. Such systems can be used to integrate local 
knowledge on management practices with detailed soil and climate maps, 
remote sensing and sophisticated models for emission calculations. Soon 
much of this functionality could be deployed in mobile applications 
(Fig. 3), which would be particularly advantageous in developing coun- 
tries where the existing infrastructure to collect and manage land-use 
data is weak’®. 


Quantifying uncertainties 
Inventories of soil C stock changes and net GHG fluxes using 
process-based models will always have uncertainty due to lack of pro- 
cess understanding, inadequate parameterization, and limitations asso- 
ciated with model inputs’” (such as weather, management and soils data). 
Empirical models generally rely on statistical analyses of measurement 
data to produce emission factors, along with an estimated uncertainty". 
However, empirical models can be biased if measurements do not fully 
reflect the conditions for the agroecosystems in the project. Even with 
the limitations in process-based understanding, process-based models 
are likely to provide the most robust framework for estimating soil C 
stock and GHG flux changes in climate-smart agriculture programmes”®. 
Monitoring, reporting and verification systems are a key element in 
a climate-smart agricultural programme. Such systems place different 
levels of importance on uncertainty depending on program type (see 
Box 1)”, but discounting payments on the basis of the level of uncer- 
tainty is likely to be part of programmes with financial incentives, such 
as cap-and-trade. Discounting encourages monitoring efforts to reduce 
uncertainty over time’®. If discounting payments for C sequestration 
and emission reduction practices with larger uncertainty is adopted in 
climate-smart agricultural programmes, then more advanced methods 
with process-based models will probably emerge as the preferred method 
because they will tend to have less uncertainty. For example, uncertainty 
was reduced by 24% when predicting national-scale C stock changes in 
the United States with process-based models compared to empirically 
derived factors”’. 
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management practices through web-based computer and mobile apps, and 
help drive advanced model-based GHG metrics. This will facilitate the 
implementation of climate-smart soil management policies, via cap-and- 
trade systems, product supply-chain initiatives for ‘low-carbon’ consumer 
products, and national and international GHG mitigation policies; it will 
also promote more sustainable and climate-resilient agricultural systems, 
globally. 


Another consideration is that uncertainties in estimating C stock and 
GHG emissions with process-based models are considerably larger for 
reporting by single individuals, particularly if the amount of change on 
an individual farm is small’”. Aggregation of many farms into larger pro- 
jects will reduce uncertainties, which could be a viable approach for man- 
aging uncertainty and reducing the discounting of incentive payments. 

Verification is an independent evaluation of estimated emissions 
intended to provide confidence that the reported results are correct, but 
in practice, the requirements for verification are highly variable across 
different GHG mitigation efforts, ranging from essentially no require- 
ments to annual evaluations”. Verification typically focuses on the 
accuracy of the estimates, and possibly the most stringent approach is 
an independent set of measurements. Although independent data may 
be less favoured in terms of costs relative to alternatives, such as expert 
judgement”, soil monitoring networks deployed at national or regional 
scales could produce independent data for evaluating model-based 
assessments of soil C stock changes and GHG emissions® and for model 
bias adjustment, using empirically based methods*!. 

Another approach to verification is to use atmospheric observations 
of trace-gas concentrations and inverse modelling to estimate fluxes 
between the atmosphere and land surface®**?. This ‘top-down’ model- 
ling, using a network of tower-based observations of CO, concentrations, 
was used to verify ‘bottom-up’ inventory modelling based on observed 
management activities in the largely agricultural region of the central 
USA*#+85, Since atmospheric observations integrate all CO) fluxes in 
the region, the inventory included a full assessment of all sources and 
sinks. However, even with the fully integrated CO, flux, it is possible to 
statistically disaggregate individual sources as part of the analysis, such 
as contributions from soil C pools to the regional flux®. Satellite-based 
measurements are providing a new source of atmospheric trace-gas data 
that can be used to estimate land surface fluxes with inverse model- 
ling frameworks*”**, Although atmospheric observations and satellite 
imagery may become a standard for verifying regional inventories in the 
future, these methods need further testing before operational systems 
can be reliably deployed. 


Conclusions and recommendations 

Climate change and GHG mitigation require an ‘all of the above’ 
approach®®, where all reduction measures that are feasible, cost-effective 
and environmentally sustainable should be pursued. For soils, a variety 
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of management practices and technologies are known to reduce emis- 
sions and promote C sequestration, most of which also provide envi- 
ronmental co-benefits. The impediment to implementing agricultural 
soil GHG mitigation strategies more aggressively to date is primarily the 
feasibility of quantifying and verifying soil mitigation activities” in a 
cost-effective manner. Overcoming this barrier therefore translates into: 
(1) increasing the acceptance of soil management within compliance and 
voluntary C markets; (2) reducing costs to governments for providing 
environment-based subsidies; and (3) meeting the demands of consumers 
for ‘low carbor products. 

Reducing and managing uncertainties are key to both improving 
predictive models and decision-support tools and the design of effective 
policies that promote soil-based GHG mitigation. To advance these 
efforts, several research and development priorities are apparent (Fig. 3). 
First, support for research site networks of soil flux (N.O, CH,4) and soil 
C measurements encompassing a wide variation in management, as well 
as ‘on-farm soil C monitoring networks® needs to be strengthened (see 
ref. 91 for an example of a large-scale research network measuring 
soil GHG flux in a geographically distributed field experiment, using 
uniform protocols, advanced instrumentation and data portals in 
Europe, and ref. 92 for an example in the USA). Such support should 
coordinate with basic research (for example, on soil organic matter 
stabilization processes, N.O and CH, microbiology, plant-microbe 
interactions, plant breeding and root phenotyping) to advance pro- 
cess understanding, develop new mitigation practices and fill gaps in 
underrepresented soil- and climate-management systems. High-quality 
data generated from consistent measurement protocols is critical for 
evaluating and improving models. These efforts may benefit from 
development of new sensor technologies, enabling cheaper and quicker 
soil measurements”. Although multiple competing models are needed, 
both to spur innovation and because no single model will be best in 
all situations, model development will benefit from greater collabo- 
ration and cross-model testing among developers, moving towards a 
more open-source, community-development approach”. Large geo- 
spatial databases of soil biophysical properties and climate variables 
are critical to quantify soil processes accurately across the landscape 
(Fig. 3). High-resolution soil maps exist in most developed countries 
(and increasingly in developing countries”), and if made publicly 
available (there is free access to fine-spatial-scale (about 1:15,000 to 
1:20,000) soil maps for the USA®’), would greatly improve capabilities 
for modelling GHG emissions at scale. 

Finally, realizing the potential for climate change mitigation through 
global soil management requires understanding cultural, political and 
socioeconomic contexts, and the ways in which widespread, sustained 
changes in practice can be successfully achieved within it?”°*. As such, 
there needs to bea greater level of engagement with the land users them- 
selves, who will be the ones implementing practices that abate GHG 
emissions and sequester C. Engagement means both education and out- 
reach, highlighting the links between agriculture and GHGs and using 
innovative strategies” (Fig. 3) to involve stakeholders in gathering and 
using their local knowledge of how the land is being used now and how 
it might best be used in the future, thus establishing a new paradigm for 
climate-smart soil management. 
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Thalamic reticular impairment underlies 


attention deficit in Ptchd1*'~ mice 


Michael F. Wells!2*, Ralf D. Wimmer***, L. lan Schmitt*+, Guoping Feng? & Michael M. Halassa*+-%7 


Developmental disabilities, including attention-deficit hyperactivity disorder (ADHD), intellectual disability (ID), and 
autism spectrum disorders (ASD), affect one in six children in the USA. Recently, gene mutations in patched domain 
containing 1 (PTCHD1) have been found in ~1% of patients with ID and ASD. Individuals with PTCHD] deletion show 
symptoms of ADHD, sleep disruption, hypotonia, aggression, ASD, and ID. Although PTCHD1 is probably critical for 
normal development, the connection between its deletion and the ensuing behavioural defects is poorly understood. 
Here we report that during early post-natal development, mouse Ptchd1 is selectively expressed in the thalamic reticular 
nucleus (TRN), a group of GABAergic neurons that regulate thalamocortical transmission, sleep rhythms, and attention. 
Ptchd1 deletion attenuates TRN activity through mechanisms involving small conductance calcium-dependent potassium 
currents (SK). TRN-restricted deletion of Ptchd1 leads to attention deficits and hyperactivity, both of which are rescued 
by pharmacological augmentation of SK channel activity. Global Ptchd1 deletion recapitulates learning impairment, 
hyper-aggression, and motor defects, all of which are insensitive to SK pharmacological targeting and not found in the 
TRN-restricted deletion mouse. This study maps clinically relevant behavioural phenotypes onto TRN dysfunction ina 


human disease model, while also identifying molecular and circuit targets for intervention. 


Recent genetic studies have revealed substantial overlap of risk genes 
across seemingly distinct neurodevelopmental and psychiatric disor- 
ders including ASD, ADHD, schizophrenia, and ID! +. Such shared 
genetic architectures could potentially explain the overlap of behav- 
ioural abnormalities across these diagnostic categories, but because of 
the difficulty in mapping circuitry mechanisms of behaviour, under- 
standing how diverse genetic lesions converge onto behaviour-relevant 
circuit dysfunction has been limited. 

Here we focused on PTCHD1, a gene that is mutated in about 1% 
of all patients with ASD and ID*-!°. Comprehensive clinical analysis 
of PTCHD1 deletion patients identified a variable non-syndromic 
neurodevelopmental disorder with symptoms ranging from atten- 
tion deficit, hyperactivity, sleep abnormality, hypotonia, and learning 
disability'’. We found that Ptchd1 was selectively expressed in the 
TRN of mice in early development and continued to be enriched in 
this structure throughout adult life. The TRN is critical for thalam- 
ocortical transmission!2~!5, generation of sleep rhythms!*16"18, 
sensorimotor processing’””®, and attention!*7!, and its perturbation 
could result in deficits in these domains. By generating a conditional 
Ptchd1-knockout mouse, we mapped ADHD-like behaviours onto 
TRN circuit dysfunction via two independent methods. First, by 
deleting Ptchd1 selectively from the TRN, we replicated the attention 
deficit and hyperactivity behaviours, but not other disease-related 
phenotypes found in the full knockout. Second, pharmacological 
rescue of TRN biophysical dysfunction selectively rescued these 
ADHD-related behaviours in the Ptchd1 knockout. These findings 
constitute the first evidence for a ‘leaky thalamus’ in a neurodevel- 
opmental disorder, where irrelevant inputs that are normally sup- 
pressed become highly distracting. Most importantly, we identified 
the TRN and its SK channels as circuit and molecular targets for 
intervention. 


Altered TRN neuronal biophysics 

The X-linked Ptchd1 gene is predicted to encode a twelve-pass trans- 
membrane protein with a sterol-sensing domain®, prompting its 
classification as a member of the Patched family and speculation that 
it may function as a Sonic hedgehog receptor”. Prenatally, Ptchd1 
expression is found in the developing cerebellum and diencephalon 
(http://www.ncbi.nlm.nih.gov/nuccore/NM_001093750). We found 
Ptchd1 mRNA to be confined to the TRN at birth, and by post-natal 
day 15 (P15) onwards, to be expressed in the striatum, cortex, and 
cerebellum (Fig. 1a, b, Extended Data Fig. 1, Supplementary Table 1). 
Interestingly, Patched family members Ptchd2 and Ptchd3 show no 
TRN expression (http://www.brain-map.org/, 77620810 and 71891731, 
respectively). Thus, Ptchd1 may play a unique role in the TRN. 

To understand how Ptchd1 deletion might contribute to neurode- 
velopmental disorders, we generated a conditional allele of Ptchd1 by 
targeting exon 2 (Extended Data Fig. 2a). This exon encodes 3 out 
of the 12 transmembrane domains, including a substantial portion of 
the sterol-sensing domain. Loss of this exon is predicted to generate a 
prematurely terminated non-functional protein (Extended Data Fig. 2b). 
In situ hybridization, genotype PCR, and cDNA transcript analyses 
confirmed successful excision of exon 2 (Extended Data Fig. 2c-e; for 
source data, see Supplementary Fig. 1). Because Ptchd1 is X-linked 
and individuals with PTCHD1 deletion are almost exclusively males, 
we used hemizygous male mice (Ptchd1*'~; referred to as Ptchd1- 
knockout) for this study. Ptchd1-knockout mice were viable with normal 
body weight allowing for direct genetic modelling of this human 
condition. 

Given the enriched expression of Ptchd1 in the TRN, we focused our 
initial investigation on this structure in Ptchd1-knockout mice. The 
TRNisa group of GABAergic neurons that provide the major source of 
inhibition to thalamic relay nuclei, and are thought to regulate cortical 
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Figure 1 | Impaired repetitive bursting and SK2 
currents in TRN neurons from knockout mice. 
a, b, Ptchd1 expression (n = 3 wild-type (WT) 
mice). Scale bar, 1 mm. c, Representative TRN 
burst traces (n= 8 WT, 9 KO cells). d, Reduced 
burst firing in knockout (KO) TRN neurons 
(n=8 WT, 9 KO cells). Vin rest, resting membrane 
potential; R;, input resistance. e, Representative 

T and SK2 current traces. f-h, Normal T (g) and 
reduced SK2 currents (h) in knockout cells (n= 8 
WT, 9 KO). i, Diminished free [Ca”*], in knockout 
cells. Representative heat maps show background- 
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rhythms, sleep, and attention*!°”®. To begin investigating possible 
physiological changes in the knockout mice, we exploited a well-known 
characteristic of TRN neurons. Depending on their resting membrane 
potential, these neurons fire in two distinct modes upon receiving syn- 
aptic input”’. At depolarized membrane potential, they fire tonic Na* 
spikes. When hyperpolarized, they generate repetitive ‘low-threshold’ 
Ca’* transients crowned by high-frequency Na* spikes known as 
bursts*®8-°. Whole-cell patch-clamp recordings of TRN neurons 
revealed a significant decrease in repetitive bursting in knockout mice 
compared to wild-type controls (Fig. 1c, d). These changes were not 
the result of altered knockout TRN neuron resting membrane poten- 
tial, input resistance, or escape from hyperpolarization (Fig. 1d, inset). 

Because repetitive bursting is known to depend on interactions 
between T-type Ca?* and small conductance calcium-activated Kt 
channels”®, we asked which of these two conductances were primarily 
impaired in the knockout mice. Under voltage clamp, we found that 
T currents were intact but SK currents were reduced by 50% in the 
knockout (Fig. le-h). As SK channels are sensitive to resting state intra- 
cellular Ca?* ([Ca?*];)?>?!, we measured this concentration using the 
ratiometric Ca** indicator Fura-2AM in TRN neurons from acute 
brain slices. We found a twofold reduction of [Ca?*]; in knockout 


TRN neurons (Fig. 1i), suggesting that altered Ca”* homeostasis may 
underlie SK channel deficits. 


Reduced TRN -generated sleep spindles 

Previous studies have suggested that TRN bursting plays a role in 
the generation of sleep spindles!®”®, predicting that the diminished 
bursting found in Ptchd1-knockout mice would lead to reduced sleep 
spindles. Using independently adjustable multi-electrode arrays to 
directly target TRN neurons for electrophysiological recordings in freely 
behaving animals and surface electroencephalography (Fig. 2a, Extended 
Data Fig. 3a)*?, we discovered that TRN neurons from knockout 
mice exhibited reduced burst firing in sleep (Fig. 2b) and knockout 
mice showed an overall reduction in sleep spindle count (Fig. 2c, 
Extended Data Fig. 3b, c). Further, the degree of TRN neuronal engage- 
ment in spindle events was substantially diminished in the knockout 
(Extended Data Fig. 3d-f), supporting the link between the cellular 
and network phenotypes in this disorder and perhaps other human 
neurodevelopmental disorders****, Importantly, consistent with the 
notion that sleep spindles are a marker for sleep stability***°, we found 
Ptchd1-knockout mice to display highly fragmented sleep (Fig. 2d-f, 
Supplementary Table 2). 
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Impaired sensory-evoked thalamic inhibition 

In addition to reduced rebound bursting, insufficient K*-mediated 
hyperpolarization is expected to more generally alter TRN neuronal 
excitability. Most importantly, it could lead to neurons not being 
hyperpolarized enough for T-type Ca** channels to de-inactivate 
and boost excitability**. Therefore, despite finding T-type currents 
to be intact under controlled voltage-clamp conditions (at —70 mV), 
insufficient hyperpolarization caused by reduced SK currents 
would render T-type channels less recruitable under physiological 
conditions'’, leading to reduced TRN neuronal activity and dimin- 
ished overall thalamic inhibition. To test this prediction at the popu- 
lation level and in the intact brain, we used chloride photometry, a tool 
that we recently developed as a proxy for population-level GABAergic 
inhibition'’. This technique utilizes fluorescence resonance energy 
transfer (FRET)-based measurements of the chloride-sensor 
SuperClomeleon*’, a reporter composed of a cyan fluorescent protein 
(CFP) FRET-donor and a chloride-quenchable yellow fluorescent 
protein (YFP) FRET acceptor (Fig. 3a). By introducing SuperClomeleon 
into visual thalamic neurons (lateral geniculate nucleus, LGN; Fig. 3b), 
we observed visual-evoked chloride transients (Fig. 3c, d), replicating 
our recent findings'?. Visual-evoked inhibitory transients were 
observed in both wild-type and Ptchd1-knockout LGN, but a quan- 
titative comparison revealed a 25% reduction of these transients in 
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the knockout (Fig. 3e—g). Impaired inhibition was also observed in 
response to trains of stimuli (Fig. 3h, j). Interestingly, a small inhibitory 
augmentation as a result of repeated stimulation was observed in 
wild-type mice and this augmentation was also significantly reduced 
in the knockouts (Fig. 3h, i). Altogether, these findings provide direct 
evidence for reduced thalamic inhibition in Ptchd1-knockout mice 
and are consistent with impaired TRN output with ensuing deficits in 
sensory-related thalamic inhibition. 


ADHD-like behaviours in Ptchd1-knockout mice 

We have previously observed that thalamic inhibition is used to sup- 
press unwanted sensory inputs during attention’. The observed 
reduction in thalamic inhibition (Fig. 3) would predict that unwanted 
sensory inputs may become particularly distracting for Ptchd1- 
knockout mice. To test this hypothesis, we trained mice on a visual 
detection task shown to require attentional engagement’? (Fig. 4a). 
Mice initiated each trial by continuously breaking an infrared barrier 
for 0.5-0.7s, ensuring proper head position when a visual stimulus was 
presented either to the right or left of the animal. Correct indication 
of visual stimulus location by nose-poking resulted in reward deliv- 
ery. Under such conditions, Ptchd1-knockout mice performance was 
comparable to that of a wild type (Fig. 4b). However, in the presence of 
a visual distractor during anticipation, knockout mice showed impaired 


Figure 3 | Reduced sensory-evoked thalamic 
inhibition in knockout mice. a, Schematic 

of CFP to YFP FRET. b, Confocal images of 
SuperClomeleon expression in LGN. ¢, Stimuli 
delivered to the eye contralateral to implanted 
LGN. d, Example traces of visual-evoked CFP and 
° YFP fluorescence changes. e-g, Reduced LGN 
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performance (Fig. 4c). The specific distractibility phenotype, rather 
than a more general failure of attentional engagement, revealed by this 
novel behavioural task is consistent with the prediction of impaired 
thalamic inhibition required for distractor suppression. It is also 
consistent with clinical findings in patients with related neurodevel- 
opmental disorders**. This impairment was not the result of general 
sensorimotor dysfunction, given the intact performance on standard 
sensorimotor testing (Extended Data Fig. 4a—c). 

Distractibility is often accompanied by hyperactivity in several 
human neurodevelopmental disorders such as ADHD*°, ADHD 
symptoms are frequently observed in patients with PTCHD1 muta- 
tions'!. Interestingly, Ptchd1-knockout mice showed a hyperactivity 
phenotype in the open field (Fig. 4d). Classical ADHD-related hyper- 
activity is predicted to be treated effectively with amphetamines, as 
has been previously described in other mouse models of neurodevel- 
opmental disease*’. Surprisingly, Ptchd1-knockout hyperactivity was 
insensitive to amphetamine treatment (Fig. 4e, f), suggesting a unique 
pathophysiological origin that may be related to the approximately 30% 
of ADHD patients who do not respond to amphetamines”. 

In addition to attention deficits and hyperactivity, Ptchd1-knockout 
mice showed a variety of behavioural abnormalities. Although knock- 
out mice showed intact performance on tasks requiring simple spatial 
learning (Extended Data Fig. 5a—d), they exhibited significant deficits 
on tasks necessitating more complex associations that are believed 
to require integration across multiple brain structures’. Knockout 
mice showed fear-induced freezing deficits in contextual (Fig. 4g) and 
cued (Fig. 4h) fear-conditioning tests. Impaired learning was corrob- 
orated by profound deficits on the inhibitory avoidance task (Fig. 4i). 
Knockout mice also exhibited motor defects such as gait abnormalities 
and hypotonia, as well as hyper-aggression (Extended Data Fig. 5e-h). 
All of these behaviours were independent of genetic background 
(Extended Data Fig. 6). These behaviours are consistent with clinical 
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findings of multi-system abnormalities in PTCHD1 deletion patients, 
suggesting the major function of PTCHD1 is evolutionarily conserved. 
Interestingly, although several PTCHD1 deletion patients have been 
diagnosed with ASD, knockout mice did not exhibit differences in 
repetitive grooming (Extended Data Fig. 7a) or social interaction 
(Extended Data Fig. 7b, c). This may reflect evolutionary divergence 
either in PTCHD1 function or in behavioural circuits related to 
PTCHDI deficiency”. 


TRN defects underlie ADHD-like behaviours 

To determine which behavioural abnormalities are caused by TRN dys- 
function, we sought strategies to largely limit Ptchd1 deletion to the 
TRN. To choose the appropriate Cre-driver line to breed with floxed 
Ptchd1 (Ptchd1*"") mice, we looked for specific TRN overlap between 
candidate markers and PTCHD1 expression. Immunohistochemical 
co-labelling experiments of Ptchd1- YFP*'-, a novel knock-in mouse 
with yellow fluorescent protein (YFP) in place of Ptchd1 exon 1 
(Extended Data Fig. 8a), revealed significant and unique overlap 
between YFP and the inhibitory neuronal marker GAD67 in the TRN 
(Extended Data Fig. 8b, Supplementary Table 3). 

Both parvalbumin and somatostatin are inhibitory neural markers 
that are widely expressed across TRN neurons*** and showed overlap 
with YFP limited to this region (Extended Data Fig. 8c), suggesting that 
mating Cre-drivers of either parvalbumin or somatostatin to Ptchd1*// 
mice would result in a Ptchd1 deletion that is primarily confined to the 
TRN. Given the early post-natal expression of PTCHD1 and somato- 
statin in the TRN (Extended Data Fig. 9), we reasoned that knock-in 
mice expressing Cre recombinase in somatostatin neurons without 
disrupting endogenous Sst expression (Sst-Cre) would be a more suit- 
able choice for such experiments*”. We crossed Ptchd1*! female mice 
to Sst-Cre mice and generated male mice lacking Ptchd1 in the TRN 
(Sst-Cre* Ptchd1*"), as confirmed by in situ hybridization (Fig. 5a). 
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Figure 5 | TRN dysfunction explains ADHD-like 
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Sst-Cre+ Ptchd1*" mice displayed attention deficits (Fig. 5b) and 
recapitulated the hyperactivity phenotype observed in the germline 
knockout (Fig. 5c, d), suggesting that these behaviours are explained 
by TRN dysfunction. Sst-Cre+ Ptchd1*!" mice showed intact learning 
(Fig. 5e) and did not show hypotonia or hyper-aggression (Extended 
Data Fig. 10a, b), reaffirming the circuit specificity of this genetic 
model. Interestingly, Sst-Cre* Ptchd1*'" mice exhibited fragmented 
sleep (Extended Data Fig. 10c-e), confirming the notion that sleep 
abnormalities observed in the germline knockout are of TRN origin, 
and that sleep and attention deficits can arise from common circuit 
dysfunction®. 

Finally, we asked whether pharmacological boosting of SK chan- 
nels could rescue ADHD-like knockout behaviours. Acute injection 
of the SK positive allosteric modulator 1-ethyl-benzimidazolinone 
(1-EBIO) significantly mitigated impaired sensory-evoked thalamic 
inhibition in germline knockout mice with no impact on inhibitory 
transients in the wild types (Fig. 6a). Consistent with these physiolog- 
ical effects, 1-EBIO did not affect attentional task performance in the 
wild-type mice (Fig. 6b), but substantially mitigated distractibility in 
the knockout mice (Fig. 6c). In addition, 1-EBIO injection rescued the 
hyperactivity phenotype in the knockouts (Fig. 6d, e). The specificity 
of this pharmacological approach was supported by its lack of effect 
on other abnormalities including hypotonia (Extended Data Fig. 10f), 


hyper-aggression (Extended Data Fig. 10g), and learning deficits (Fig. 6f). 
Together, these results further support SK channel dysfunction as a 
cellular mechanism for these behavioural abnormalities. Future studies 
examining the therapeutic benefit of SK targeting for sleep fragmen- 
tation and instability and its potential relevance to inattention would 
be important. 


Discussion 
To our knowledge, this study is the first to show that a TRN circuit 
deficit is central to a specific set of behavioural impairments in a human 
neurodevelopmental disease model. Using conditional knockout of 
Ptchd1, a gene with expression restricted to the TRN during early 
post-natal development, we mapped behavioural phenotypes onto 
their circuit substrates. We additionally discovered that modulation of 
SK channel function could be explored as a potential novel treatment 
strategy for PTCHD1 deletion patients with attention deficits and 
hyperactivity. Hyperactivity of TRN origin may be the motor equiv- 
alent of sensory distractibility, which could involve dysfunctional 
motor TRN-thalamic circuits. Future experiments exploring inhibitory 
control of motor thalamus will formally test this conjecture. 

Although basic studies have shown the TRN to be central for atten- 
tion?” and sleep spindles’®, our study directly shows how disease- 
relevant impaired TRN output can result in attention deficits, 


Figure 6 | SK conductance augmentation corrects 
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hyperactivity, and sleep disruption. This direct demonstration was made 
possible by developing SuperClomeleon photometry, a technique that 
can now be widely applied to disease models as a screen for impaired 
thalamic inhibition. It is possible that a ‘leaky thalamus’ caused by 
impaired TRN function underlies attention deficits, hyperactivity, and 
sleep disruption across various neurodevelopmental disorders, and we 
expect the set of genetic, physiologic, and behavioural approaches we 
introduce here to facilitate such discoveries. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Ptchd1 conditional knockout mice. Ptchd1 conditional knockout mice were gen- 
erated by homologous recombination in R1 embryonic stem cells and implanted in 
C57BI/6] blastocysts using standard procedures. The targeting vector was designed 
to flank exon 2 of the Ptchd1 gene with loxP sites and a neomycin (Neo) selec- 
tion cassette. Chimaeric mice were crossed to C57Bl/6J females (Jackson Labs). 
Germline transmission was determined by genotyping PCR of mouse tail DNA, 
using primer pFW PtlcKO Gen la (5’-GGATGGTACCACCTACAATATGC-3’) 
and pRV PtlcKO Gen 3b (5’/-AAGCCAAAGAGTTTACCCTG-3’) for the wild- 
type allele (187 base pairs (bp)) and the floxed allele (227 bp). The F1 hybrids were 
crossed to C57BI/6) B-actin Flp mice to excise the Neo cassette. The floxed mice 
were then backcrossed to C57B1/6] mice for five generations. After the fifth gener- 
ation, speed congenic genotyping PCRs were conducted to determine the approx- 
imate purity of the background. Only mice showing >95% C57Bl/6] background 
were used for subsequent matings. Backcrossed Ptchd1-knockout mice were then 
bred with C57BI/6] 3-actin Cre mice to produce germline knockouts of the floxed 
allele. Genotypes were determined by PCRs using the pFW PtlcKO Gen 1a and 
pRV PtlcKO Gen 4c (5’-GGACTTTGGAGTAAACCACC-3’; in the Neo cassette) 
primers for the knockout allele (351 bp). For all behavioural experiments in the 
C57Bl/6) background, Ptchd1 Y/~ (knockout) and Ptchd1*'+ (wild type) males were 
bred with Pichd1‘’~ females. For the C57 and 129 mixed background behavioural 
experiments, 129 wild-type males were bred with Ptchd1*/~ C57Bl/6J females and 
the F1 offspring were used for experiments. Animals were housed 3-5 by genotype 
per cage at a constant 23°C in a 12h light-dark cycle (lights on at 07:00) with 
ad libitum food and water (unless otherwise noted). 

Healthy age-matched male mice between 2-5 months were used for all behav- 
ioural experiments. Sample sizes were chosen based on standards in the field as 
well as previous experience with phenotype comparisons. No statistical methods 
were used to predetermine sample size. All behaviour experiments employed a 
group counterbalancing strategy to ensure that both genotypes were being tested 
at equal frequencies throughout the testing day. For drug studies, mice were cho- 
sen at random to be included in the vehicle-treated versus drug-treated groups. 
Experimenters were blind to genotype, but not group (that is, group A versus group B), 
during data acquisition and analysis. D'Agostino and Pearson omnibus normality 
tests were run. When normality was violated, non-parametric tests (Wilcoxon 
rank-sum) were used. ANOVA was used to estimate group variance where mul- 
tiple comparisons were made. All data analysed with two-tailed t-tests showed 
similar variances between groups. Data presented as scatter plots with mean 
denoted by bar, unless otherwise noted. All animal experiments were conducted 
according to the NIH Health Guide for Care and Use of Laboratory Animals and 
were approved by the Massachusetts Institute of Technology Institutional Animal 
Care and Use Committee. 

Ptchd1-YFP mice. Ptchd1-YFP mice were generated by homologous recom- 
bination in Rl embryonic stem (ES) cells and implanted in C57Bl/6) blas- 
tocysts using standard procedures. The targeting vector replaced exon 1 of 
the mouse Ptchd1 gene with ATG-eYFP-STOP cassette and a Neo cassette. 
Correctly altered ES cell colonies were PCR screened using primers targeting 
the YFP insert and long-arm PCR (LA Taq) methods. Chimaeric mice were 
crossed to C57BI/6] females from Jackson Labs. Germline transmission was 
assessed through genotyping PCR of mouse tail DNA, using primers pFW 
Ptl-YFPki Gen 3a (5’‘-TTACTTCCTTTTCCCCACC-3’), pRV Ptl-YFPki Gen 
3a (5‘-CCCAGGTTCTCTAGTAGATTCC-3’), and pFW Ptl-YFPki Gen 3b 
(5'‘-GGAGAGTGAATCTAGGTGGAG-3’) for the wild-type allele (positive band, 
190 bp) and the YFP positive allele (positive band, 316 bp). The F1 hybrids were 
backcrossed to C57B1/6] mice for two generations. Female mice containing the YFP 
insertion (Ptchd1-YFP*’—) were used for all immunohistochemistry experiments. 
In situ hybridization. mRNA in situ hybridization was performed with 201m 
cryosections from freshly frozen PO, P15, and P35 brain tissue from male mice 
using a mixture of two digoxigenin (DIG)-labelled probes against mouse Ptchd1 
cDNA (GenBank Accession NM_001093750.1; ex2 base pairs 372-1006 and ex3 
base pairs 1290-2027), except for the wild type and knockout comparisons, in 
which only the ex2 probe was used. The hybridization signal was detected using an 
alkaline-phosphatase-conjugated anti-DIG antibody (Roche) and developed using 
5-bromo-4-cloro-indolylphosphate/nitroblue tetrazolium (Roche). Sections were 
imaged using an Olympus BX61 motorized fluorescent microscope. 

Immunohistochemistry. Mice were perfused with ice-cold 1x PBS and 4% par- 
aformaldehyde. Brains were then fixed overnight at 4°C before vibratome sec- 
tioning. We washed 50 1m sections with 1 x PBS for 3 x 5 min before a 1h room 
temperature incubation in blocking solution (5% normal goat serum, 2% BSA, 
0.2% Triton X-100 in 1 x PBS). Sections were then incubated overnight at 4°C in 
primary antibody solution: rabbit anti-GFP (Invitrogen A11122; 1:1000), chicken 
anti-GFP (Abcam 13970; 1:1000), mouse anti-GAD67 (Millipore 5406; 1:1000), 
rabbit anti-somatostatin (Peninsula Labs T4102; 1:1000), and mouse anti-NeuN 


(Millipore MAB377; 1:1000). Following the 3 x 20 min wash in 1 x PBS, tissue was 
incubated for 4h at room temperature in a second antibody solution: goat anti- 
rabbit Alexa 488 (LT A-11034; 1:1000), goat anti-rabbit Alexa 555 (LT A-21428; 
1:1000), goat anti-mouse Alexa 488 (LT A-11001; 1:1000), goat anti-mouse Alexa 
555 (LT A-21422; 1:1000), and goat anti-chicken Alexa 488 (LT A-11039; 1:1000). 
Sections were once again washed 3 x 20 min in 1 x PBS before mounting onto 
glass slides using Fluoro-Gel. Images were taken with an Olympus Fluoview 1000 
confocal microscope. YFP co-labelling was quantified using Image]. Images were 
converted to 8-bit files and automatically background substracted and thresholded. 
The resulting images were converted to a mask and then the processes ‘fill holes 
and ‘watershed’ were implemented. The processed images were then analysed using 
the ‘analyze particles feature with the squared size set to ‘200-Infinity’ and the show 
feature set to ‘outlines. This end product of this procedure was an image containing 
enumerated cell skeletons. Skeleton images from the green and red channels were 
merged and overlapping skeletons were counted as a co-labelling event. 

Slice electrophysiology. Freshly prepared brain slices from P21-P28 male mice 
were superfused with oxygenated ACSF (125nM NaCl, 25nM NaHCO3, 25nM 
glucose, 2.5nM KCl, 1.25nM NAH>PO,, 1.2nM MgCh, 2nM CaCh, 1.7nM 
ascorbic acid) and recorded at 30-34°C. Patch pipettes (2.5-4 MQ) contained the 
following intracellular solution in 140 nM KMeSO,, 10nMKCI, 10nM HEPES, 
0.1nM EGTA, 4nM Mg-ATP, 0.2nM Na-GTP, 10nM phosphocreatine, (285nM 
mOsm, pH 7.2). For rebound burst characterization, cells were held in current 
clamp at potentials ranging from —85 to —25 mV through constant current injec- 
tion. Rebound bursting was determined following a 600 ms, —0.5nA current step. 
Events crossing two standard deviations of the baseline noise were considered 
bursts. To determine T and SK currents, responses to different hyperpolarizing 
steps (500 ms, ranging from —110 to —60 mV) were recorded in voltage clamp 
configuration. T currents were isolated through application of the SK channel 
blocker apamin (100nM), whereas SK currents were estimated by digital subtrac- 
tion of the isolated T current from control currents without apamin in slices from 
4-5 mice per genotype. Cells where the input resistance changed >15% during 
the recording were excluded from the analysis. 

In vivo TRN recordings. Hyperdrives containing 12 individually adjustable 
microdrives loaded with 1-2 stereotrodes were built as previously described*”. 
Mice were anaesthetized with 1% isofluorane and mounted on a stereotaxic frame. 
A3 x 2.5mm craniotomy (centre coordinate from bregma, medio-lateral: 2.5 mm, 
anterior-posterior: —1.3 mm was drilled and the hyperdrive was implanted at a 
15° angle relative to midline. At time of implantation, stereotrodes were lowered 
<500 1m into the brain. Three stainless steel screws (one located prefrontal and 
two cerebellar) served as electroencephalography (EEG) electrodes and ground 
and anchored together with two additional fixation screws the hyperdrive to the 
skull. After recovery, mice were connected to a custom-made 64-channel pream- 
plifier headstage (Neuralynx) and data was acquired using a Neuralynx Digilynx 
recording system. Stereotrode signals were amplified, filtered between 0.1 Hz 
and 9kHz and digitized at approximately 30 kHz. Spikes were manually clustered 
using the MClust toolbox for MATLAB, and bursts were identified as at least two 
spikes with an inter-spike interval of <10 ms, which were preceeded by > 70 ms 
of silence. Spindle detection was performed as previously described”. The EEG 
signal was filtered within the spindle frequency band (9-15 Hz) and the Hilbert 
transform (MATLAB function ‘hilbert’) was computed. The envelope of the signal 
(1s smoothing) was used as a basis for spindle detection. A threshold of one standard 
deviation (s.d.) was applied and each threshold crossing, with parameters of >0.5s 
and <3s, were initially included. These events were subsequently visually inspected 
before being included in the analysis. To assess consistency in the phase locking 
of individual TRN neurons with spindles, we used spike phase synchrony analysis 
between unit firing and spindle band (9-15 Hz) filtered component of the local 
field potential (Extended Data Fig. 3d-f). For each TRN unit, we first constructed 
a spike-phase histogram of firing rates relative to alpha. Units were considered 
significantly phase-locked if the distribution of spike phases significantly differed 
from a uniform distribution on the basis of Rayleigh’s test for circular uniformity 
(cut-off, P< 0.05). To quantify the degree of phase-locking, the Pearson’s corre- 
lation coefficient between spike counts and phase angle was computed for each 
unit. Analysis was performed in MATLAB using the circular statistics toolbox. 
Sleep analysis. Sleep-related immobility was used to measure sleep bout duration 
in mice (C57, n=9 WT and 10 KO; Sst-Cre, n= 10 WT and 10 KO), a technique 
that has previously been shown to correlate (r= 0.94) with EEG recordings*’. We 
used BIOBSERVE Behavioural Sequencer boxes (BIOBSERVE) that utilize Piezo 
sensors and video recordings to automatically score 23 different movements/behav- 
iours, including immobility (Sequencer label ‘still’). The lighting inside the boxes 
was set to 5-10 lx, which mimicked home cage light intensity in the MIT animal 
holding facility. Mice were placed into a custom-made transparent plastic cylin- 
der (16cm diameter x 16cm height) containing home cage bedding in order to 
reduce the arena parameters and facilitate sleep. At 07:00, mice were placed in the 
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behavioural sequencers and given 1h to acclimate. After the acclimation period, 
behaviour was automatically scored for 6h (08:00-14:00). Data was manually 
curated to identify bouts of sleep, which were defined as at least 40s of immobility”” 
that is not interrupted by more than 2s of upper body movements (sequencer labels 
‘head? ‘face’ ‘nose’ ‘cheek; ‘back; ‘orient_look; ‘orient sniff’) or by 4 or more sec- 
onds of general body movements (sequencer labels ‘paw, ‘leg; ‘tummy; etc.). These 
bout parameters were confirmed by two observers viewing raw video during pilot 
tests. Behavioural recordings and analysis were conducted by a genotype-blind 
experimenter. Given the 40s minimum required to be classified as a bout, median 
data and cumulative probability were used to compare genotypes. Kolomgorov— 
Smirnov and the Wilcoxon rank-sum tests were used for statistical analysis. All 
individual data points are plotted with crossbar denoting median. 
Fibre-photometry-based optical chloride measurements. Mice (n=6 WT and 
6 KO) were injected with 400 nl of AAV-hSyn-SuperClomeleon into the LGN (from 
Bregma A-P, —-2mm; M-L, —2.1 mm; D-V, —2.5mm) and chronically implanted 
with optical fibres (400 |1m, Doric lenses) targeted directly above LGN (D-V, 
2.1mm). Following at least two weeks of virus expression, FRET-based measure- 
ment of visual-evoked [Cl]; responses was performed as previously described)’. 
Briefly, CFP excitation was achieved through a fibre-coupled LED (Thorlabs) 
light source, filtered using a 434nm clean-up filter (MF434-17 Thorlabs) and 
light was delivered to the LGN via a 600 1m, 0.48 NA optic patch cord (Fig. 3a) 
SuperClomeleon CFP and YFP emissions were separated using a single-edge 
beam splitter (FF511-Di01, Semrock) and collected using a two femtowatt silicon 
photoreceiver (Newport). Signal was digitized and recorded using a TDT signal 
acquisition system (Tucker-Davis Technologies). Visual stimulation consisted of 
a50ms LED light pulses delivered to the eye contralateral to the recorded LGN. 
Normalized delta fluorescence (AF/F) was calculated for evoked responses relative 
to the baseline fluorescence level before each event (1-s window) and smoothed 
with a convolution filter (50 ms half-width). The minimum signal within a 500 ms 
window following stimulation was considered the peak response. For pharma- 
cological SK channel enhancement, mice were injected with 25 mgkg ! 1-EBIO 
following 30 min of baseline recordings. Maximal drug effects were estimated from 
evoked responses recorded between 30 min to 1h following injection. 

Visual detection task. Mice (C57, n=8 WT and 9 KO from 2 independently tested 
cohorts; Sst-Cre, n= 8 WT and 8 KO from 2 independently-tested cohorts; 1-EBIO, 
7 WT-Veh, n=7 WT 1-EBIO, 7 KO veh., 7 KO 1-EBIO) were food restricted to 
85-90% of their ad libitum body weight and training occurred in a custom-built 
test chamber as previously described**. During testing, a white noise indicated 
that a new trial was available, and mice had to continuously break an infrared 
barrier for 500-700 ms to initiate a trial. Upon successful initiation, a 50 ms 
visual stimulus was presented randomly either on the left or right side. Correct 
response at the corresponding nose-poke unit resulted in a milk reward (10 il 
evaporated milk, Nestle) that was available for 15s. Following an inter-trial interval 
of 5s, a new trial became available. Response at the incorrect location resulted in 
immediate blockage of poke access and a 30s timeout before the next trial. To 
test for distractibility, in one third of the trials a 50 ms distractor appeared at the 
opposite location of where the stimulus would be displayed in the 200 ms time 
window before successful initiation. For drug treatment experiments, mice were 
injected with 1-EBIO (Tocris #1041; 25mg kg! in 10% DMSO; subcutaneous 
injection) or vehicle 30 min before testing. Wilcoxon rank-sum test was used for 
statistical analysis. All individual data points are plotted with crossbar denoting 
median. 

Fura-2AM calcium imaging. Freshly prepared brain slices from P21-P28 male 
mice (n=4 WT and 5 KO) were superfused with oxygenated ACSF (125 nM NaCl, 
25nM NaHCOs;, 25 nM glucose, 2.5nM KCl, 1.25nM NAH>POy,, 1.2nM MgCh, 
2nM CaCl, 1.7nM ascorbic acid) and recorded at 30-34 °C. Cells (n=37 WT, 
36 KO) were filled with Fura-2AM dye (Molecular Probes) using a Picospritzer II 
that applied ~10 pounds per square inch (psi) for 1 min through a 1-2 MQ? glass 
pipette. Slices were given 1h to recover in ACSF before imaging. Individual cells 
were imaged using an Olympus BX61WI microscope with an attached CoolSnapF2 
camera. Pixel intensity was measured using Image] software with whole-field 
background correction. Ratiometric values were converted to [Ca”*]; using the 
following equation 


[Ca?*]; = Ka(R a Rmmin)/(Rmax = R)Sp/Sp2 


where Kj refers to the Ca** dissociation constant (140nM), R refers to the ratiom- 
etric measurement of the observed cell, Rinax and Rmin correspond to the ratio under 
conditions of saturated Ca’* levels and in zero Ca**, respectively. The values of Si2 
(bound state) and Sp (free Ca** state) are proportional to the fluorescence excited 
by 380 nm under conditions of saturated Ca”* levels and in zero Ca*", respectively. 
Unpaired t-tests were used to compare nanomolar concentrations between the two 
groups. Data presented as mean +s.e.m. 
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Open field. Locomotor activity in mice (C57, n=30 WT and 31 KO from 3 inde- 
pendently tested cohorts; mixture of C57 and 129 mice (referred to hereafter as 
mix), 10 WT and 11 KO from 1 cohort; amphetamine-treated, 5 WT veh., 5 WT 
amph., 6 KO veh., 6 KO amph.; 1-EBIO, 13 WT veh., 13 WT 1-EBIO, 13 KO veh., 
and 15 KO 1-EBIO from 2 independently-tested cohorts; Sst-Cre, 21 WT and 22 
KO from 2 independently tested cohorts) was evaluated over a 60 min period in 
an automated Omnitech Digiscan apparatus (AccuScan Instruments). Locomotor 
activity was assessed as total distance travelled (m). Anxiety-like behaviour was 
defined by number of rearings and time spent in the centre as compared to time 
spent in the perimeter (thigmotaxis) of the open field. For drug treatment exper- 
iments, mice were placed in the open field arena for 30 min before amphetamine 
(3mgkg! in saline; intraperitoneal injection), 1-EBIO (25mgkg ! in 10% DMSO; 
subcutaneous injection), or vehicle injections. Mice were then returned to open 
field arena for an additional 30-90 min. Two-way repeated measures ANOVA 
with Bonferroni post-hoc tests were used for statistical analysis. Time-binned data 
(‘distance’) presented as mean +s.e.m. Summated data (‘total distance’) plotted as 
individual data points with crossbar denoting mean. One C57 knockout mouse 
was excluded for escaping from the arena during testing. 

Grooming. Young adult male mice (C57, n =9 WT and 13 KO from 1 cohort; 
mix, n= 10 WT and 12 KO from 1 cohort) were used for analysis of grooming 
behaviour. Individually housed animals were habituated in the testing room for one 
hour before experimentation. Mice were video-taped for 2h under 2 Ix (red light) 
illumination. Grooming behaviours were coded from 19:00-21:00 (2h beginning at 
the initiation of the dark cycle). This segment was analysed using Noldus Observer 
software and the total amount of time in the 2h segment spent grooming was 
determined. A genotype-blind observer recorded all types of grooming, includ- 
ing incidences of face-wiping, scratching/rubbing of head and ears, and full-body 
grooming. Two-tailed t-tests were used for statistical analysis. All individual data 
points are plotted with crossbar denoting mean. 

Rotarod. Motor coordination was assessed in mice (C57, n=19 WT and 21 KO 
from 2 independently tested cohorts; mix, n= 10 WT and 10 KO from 1 cohort) 
using an accelerating rotarod test (Med Associates) over the course of two days. On 
the first day (training day) animals underwent three 5 min trials at a constant speed 
(16r.p.m.). On the second day (testing day), animals underwent three 5 min trials 
at accelerating speeds (4-40 r.p.m.). For all trials, the latency to fall was determined. 
Animals were tested for three trials in a single day with an inter-trial interval of 
10-30 min. Two-way repeated measures ANOVA with Bonferroni post-hoc tests 
were used for statistical analysis. Data plotted as mean +s.e.m. Two C57 knockout 
mice were excluded for jumping off the rotarod during testing. 

Hanging wire. Mice (C57, n= 12 WT and 11 KO from 1 cohort; mix, 10 WT and 
12 KO from 1 cohort; 1-EBIO, 6 WT veh., 6 WT 1-EBIO, 6 KO veh., and 6 KO 
1-EBIO from 1 cohort; Sst-Cre, 12 WT and 11 KO from 1 cohort) were suspended 
40 cm above the ground from a 2mm horizontal wire. The average of three trials 
with an inter-trial interval of 5 min was recorded. For drug treatment experi- 
ments, mice were injected with 1-EBIO (Tocris #1041; 25 mgkg! in 10% DMSO; 
subcutaneous injection) or vehicle 30 min before testing. Two-tailed t-tests were 
used for statistical analysis. All individual data points are plotted with crossbar 
denoting mean. 

Gait. The forepaws of the mice (C57, n=10 WT and 11 KO from 1 cohort) were 
painted green and the hindpaws were painted pink. After a two minute habituation 
trial, the mice were allowed to walk down a 50cm track. The length and width of 
each stride were measured by an observer blind to genotype and the averages were 
recorded. Two-tailed t-tests were used for statistical analysis. All individual data 
points are plotted with crossbar denoting mean. 

Hot plate. Mice (C57, n=20 WT and 21 KO from 2 independently tested cohorts; 
mix, 10 WT and 12 KO from 1 cohort) were placed onto a heating block set to 55°C 
surface temperature (Columbus Instruments). Latency to lick a forepaw or hind- 
paw was measured. The average of three trials with an inter-trial interval of 5 min 
was recorded. Two-tailed t-tests were used for statistical analysis. All individual 
data points are plotted with crossbar denoting mean. 

Acoustic startle and pre-pulse inhibition. Auditory abilities and sensory motor 
function in mice (C57, n=20 WT and 21 KO from 2 independently-tested cohorts; 
mix, 10 WT and 12 KO from 1 cohort) was measured using Hamilton Kinder 
Scientific Pre-pulse Startle Monitor chambers with Startle Monitor software. On 
the first day, mice underwent a 5 min acclimation trial in the acoustic startle boxes. 
On the second day, half of the mice were tested on the acoustic startle protocol 
while the other half was tested on the pre-pulse inhibition protocol. These protocol 
groups were switched on the third day. To test acoustic startle, mice are presented 
with pulses of various decibel levels without pre-pulses for approximately 30 min. 
The testing session is preceded by a 5 min exposure to 65 dB background noise. 
Each mouse then receives a total of 92 stimuli (trials) with inter-trial intervals 
ranging from 7-23 s presented in pseudo-random order. The stimuli include a pres- 
entation of eight pulse-alone trials (120 dB, 40 ms pulse, four at the beginning and 
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four at the end of the session), 77 pulse trials (seven each of 70, 75, 80, 85, 90, 95, 
100, 105, 110, 115, and 120 dB, 40 ms pulse), and seven trials each without pulse or 
pre-pulse presentation. To test pre-pulse inhibition, mice were once again exposed 
to 65 dB background for 5 min before testing. Each mouse received a total of 57 
stimuli (trials) with inter-trial intervals ranging from 7-23 s presented in pseudor- 
andom order. The stimuli include a presentation of eight pulse-alone trials (120 dB, 
40 ms pulse, four at the beginning and four at the end of the session), 35 pre-pulse 
trials (seven each of 70, 75, 80, 85 and 90 dB, 20 ms pre-pulse given 100 ms before 
a 120dB, 40 ms pulse), and seven trials each without pulse or pre-pulse presenta- 
tion. The pre-pulse inhibition percentage within each test session was calculated 
as follows: (100 — (mean pre-pulse response/mean pulse response) x 100)). For 
all experiments, response to startle stimuli is measured in newtons (N). Startle at 
each pulse level is averaged across trials and then across animals in a treatment 
group. Two-way repeated measures ANOVA with Bonferroni post-hoc tests were 
used for statistical analysis. Data plotted as mean + s.e.m. 

Three-chambered social interaction assay. Littermate male mice (C57, n= 10 
WT and 11 KO from 1 cohort; mix, 10 WT and 12 KO from 1 cohort) were used for 
all tests. Age and size-matched 129 male target subjects (‘stranger 1’ and ‘stranger 
2’) were habituated to being placed inside wire cages for three days (20 min per 
day) before beginning of testing. Test mice were habituated to the testing room 
for at least 1h before the start of behavioural tasks. The social test apparatus con- 
sisted of a transparent acrylic box with removable floor and partitions dividing the 
box into three chambers. The wire cages used to contain the stranger mice were 
cylindrical, 11 cm in height, a bottom diameter of 10.5 cm with the bars spaced 
1cm apart (Galaxy Cup, Spectrum Diversified Designs). An inverted metal can 
was placed on the top of the cage to prevent the test mice from climbing on the 
top of the wire cage. For the sociability test, the test animal was introduced to the 
middle chamber and left to habituate for 10 min. Following this period, the middle 
chamber doors were opened and the test mouse was allowed to freely explore all 
three chambers for an additional 10 min. The test mouse was then returned to the 
middle chamber, after which an unfamiliar mouse (stranger 1) was introduced 
into a wire cage on one of the side-chambers and an empty wire cage (E) on the 
other side-chamber. The dividers were then raised and the test animal was allowed 
to freely explore all three chambers over a 10 min session. Following the 10 min 
session, the mouse was returned to the middle chamber while a novel stranger 
mouse (stranger 2) was inserted in the wire cage, previously empty, and again the 
test animal was left to explore for a 10 min session. The time spent by the mouse 
(nose-point) in close proximity (~5 cm) to the wire cages was calculated using 
automated software (Noldus Ethovison 9). The release of the animals and relative 
position of social and inanimate targets was counterbalanced. However, for each 
individual test animal the location of stranger 1 was maintained during stranger- 
1-E and stranger-1-stranger-2 testing of the social behaviour. One-way ANOVA 
with Bonferroni multiple comparison tests were used for statistical analysis. All 
individual data points are plotted with crossbar denoting mean. 

Resident intruder. Mice (C57, 1 =10 WT and 10 KO from 1 cohort; 1-EBIO, 6 
WT veh., 6 WT 1-EBIO, 6 KO veh., 6 KO 1-EBIO from 1 cohort; Sst-Cre, 6 WT 
and 6 KO from 1 cohort) were individually housed for 2 weeks before testing with 
bedding left undisturbed for one week before testing. Animals were tested in a 
room with 101x lighting. On the test day, animals were habituated to the test room 
for 1h before testing. After acclimation, an age- and weight-matched conspecific 
male stranger mouse was introduced to the home cage of the test animal. The 
subsequent interactions were videotaped for 5 min. All interactions were analysed 
using Noldus Observer by a genotype-blind observer. Aggressive interactions were 
defined by instances of biting, fighting, and tail-rattling. For drug treatment exper- 
iments, mice were injected with 1-EBIO (Tocris #1041; 25 mgkg! in 10% DMSO; 
subcutaneous injection) or vehicle 30 min before testing. Wilcoxon rank-sum and 


Kruskal-Wallis non-parametric tests were used for statistical analysis because 
wild-type mice did not pass normality tests. All individual data points are plotted 
with crossbar denoting median. 

Inhibitory avoidance. Mice (C57, n= 24 WT and 23 KO from 2 independent- 
ly-tested cohorts; mix, 10 WT and 12 KO from 1 cohort, 1-EBIO, 7 WT veh., 8 
WT 1-EBIO, 9 KO veh., and 10 KO 1-EBIO from 2 independently-tested cohorts; 
Sst-Cre, 11 WT and 12 KO from 1 cohort) underwent one training session followed 
by 24h and 48h post-training trials. On the training day, mice were placed in the 
light side of the inhibitory avoidance box (Ugo Basile) and allowed to habituate for 
30s. To quantify pre-shock latency and the effects of baseline locomotor activity 
levels on this task, the door to the dark side of the box was then opened and the 
latency to cross was measured. Upon entering the dark side of the box, the mice 
were given a 0.5mA shock for 2-4, followed by a 60s post-shock habituation. 
On the probe trials, mice were placed in the light side of the box with the door to 
the dark side already opened. Latency to cross was once again measured with a 
maximum duration of 9 min. For drug treatment experiments, mice were injected 
with 1-EBIO (Tocris #1041; 25 mgkg ! in 10% DMSO; subcutaneous injection) 
or vehicle 30 min before the shock training protocol. Two-way repeated measures 
ANOVA with Bonferroni post-hoc tests were used for statistical analysis. All indi- 
vidual data points are plotted with crossbar denoting mean. 

Fear conditioning. Mice (C57, n= 10 WT and 11 KO from 1 cohort) were 
tested for fear-induced freezing using Med Associates fear-conditioning cham- 
bers encased in sound attenuating cubicles optimized for near infrared (NIR) 
video recording. The training protocol involved a 3 min habituation period in 
the conditioning box, followed by three rounds of a 30s tone +2 of 0.75mA 
shock +90s rest. The final shock was followed by a 2 min post-training habit- 
uation. The following day, mice were returned to the conditioning box and the 
time spent freezing was measured. Four hours after, the mice were returned to 
the conditioning boxes with modifications. A white triangular insert was added 
to change the dimensions of the box and 0.1% acetic acid was sprayed onto the 
base of the box in order to change the scent of the box. Once again, the time spent 
freezing over the course of 5 min was measured. Freezing was measured using 
Video Freeze software package analysis of NIR recordings. Two-tailed t-tests were 
used for statistical analysis. All individual data points are plotted with crossbar 
denoting mean. 

Morris water maze. Spatial learning testing in mice (C57, n= 10 WT and 10 KO 
from 1 cohort) was conducted using a testing pool that was 120 cm in diameter 
and a platform 8 cm in diameter. The platform was submerged 1 cm below the 
water surface. Pool water was maintained at 23.0 + 0.5°C and made opaque by 
mixing in non-toxic white paint. During training, 90s duration trials were used. 
If the animals did not find the platform within 90s, the experimenter guided 
the animal to the platform. After reaching the platform, the animals were left 
for 15s on top of the platform before being removed. Trials were administered 
for 5-6 days with four trials per animal per day with the platform located in the 
north-west (NW) quadrant. For two consecutive days after the training protocol, 
60s probe trials were performed (one per day). The reversal training commenced 
with the platform in the south-east quadrant, and proceeded as described above. 
The experimenter followed the animals’ progress using tracking software out- 
side of the testing room. Tracking and analysis were performed using the Noldus 
Ethovison software. Two-way repeated measures ANOVA with Bonferroni post- 
hoc tests and one-way ANOVA with Bonferroni multiple comparison tests were 
used for statistical analysis. All individual data points are plotted with crossbar 
denoting mean. 


50. Fisher, S. P. et al. Rapid assessment of sleep-wake behavior in mice. J. Biol. 
Rhythms 27, 48-58 (2012). 
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Extended Data Figure 1 | Developmental expression pattern of Ptchd1. a-d, In situ hybridization labelling of Ptchd1 mRNA at PO (coronal) (a), P15 
(coronal) (b), and P35 (coronal and sagittal) (c-d) from 3 C57/B16 wild-type mice per age. White arrows indicate location of TRN region (scale bars, 1 mm). 
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Extended Data Figure 2 | Generation of Ptchd1-knockout mouse. transmembrane domains normally present in the endogenous full-length 
a, Schematic describing strategy to create Ptchd1-knockout mouse. Mice isoform. c, In situ hybridization probes targeting exon 2 confirm successful 
containing targeted allele were crossed to b-actin Flp mice to remove genetic ablation of full-length Ptchd1 mRNA (scale bar, 1mm). d, PCR 
the Neo cassette and 3-actin Cre mice to excise exon 2. b, Diagram genotyping confirms deletion of exon 2 from genome of male knockout 
depicting the ‘full-length’ and non-functional ‘exons 1+3’ Ptchd1 isoforms. mice. e, qPCR of wild-type and knockout cDNA samples shows removal of 
Genetic ablation of Exon 2 results in the removal of a majority of the full-length Ptchd1 isoform. Het, heterozygous. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Burst and spindle phase-locking 
characteristics of Ptchd1-knockout and wild-type TRN neurons 

in vivo. a, Left, example of unit clustering for a stereotrode recording. 

Two units (green and blue) are clearly separated when plotting peak- 
trough of the two electrodes of the streotrode against each other. Right, 
spike-wave form of the two clustered units as they appear on the two 
electrodes of the stereotrode. Raw trace below shows a burst discharge 
(asterisk) of each unit during non-rapid eye movement sleep with coloured 
ticks indicating corresponding individual spikes. A burst was identified 

as at least 2 spikes with an inter-spike interval of <10 ms, preceded by a 
period of 70 ms silence. Enlarged trace shows the accelerando-decelerando 
firing pattern characteristic for a TRN burst. b, Firing rate during non- 
rapid eye movement sleep is comparable between genotypes (n= 89 WT, 


80 KO cells from 4 WT, 3 KO mice; P > 0.1, Kolmogorov-Smirnov test). 
c, Ptchd1-knockout TRN neurons show reduced propensity to generate 
bursts, even when excluding the 10% of knockout cells with the highest 
firing rate (n = 89 WT, 72 KO cells from 4 WT, 3 KO mice; P< 0.05, 
Kolmogorov-Smirnov test). d, Spindle-phase histogram for an example 
wild-type and knockout neuron. Note, the wild-type neuron shows a 
preferred phase around the peak (0 degrees) of the spindle oscillation in 
wild type but not knockout. e, Example local field potential recording 
(LEP; top) showing the temporal alignment of TRN spikes (bottom) to 
the preferred phase of the spindle activity (9-15 Hz, middle). f, Ptchd1- 
knockout mice show reduced phase-locking strength to spindle activity 
compared to wild-type littermates (n = 89 WT, 80 KO cells from 

4 WT, 3 KO mice). 
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Extended Data Figure 4 | Ptchd1-knockout mice have intact sensory 
responses and rotarod performance. a-c, Normal acoustic startle 

(a), pre-pulse inhibition (PPI; b) and hot plate response (c) in Ptchd1- 
knockout mice (n =20 WT (a-c), 20 KO (b), 21 KO (c)). d, Ptchd1- 
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knockout mice show normal motor coordination on the accelerating 
rotarod test (n= 19 WT, 20 KO). Two-tailed t-tests (c) and two-way 
repeated measures ANOVA with Bonferroni post-hoc tests (a—b, d) were 
used for statistical analysis. Error bars, mean + s.e.m. NS, not significant. 
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Extended Data Figure 5 | Intact spatial learning but motor and 
aggression abnormalities in Ptchd1-knockout mice. a, Comparable 
learning curves between wild-type and knockout mice during cued 
training protocol. b, Intact spatial learning demonstrated in 24h probe 
trial. c, Ptchd1-knockout mice show normal reversal learning curve. d, No 
significant difference between wild-type and knockout mice in 24h probe 
trial after reversal learning protocol (n= 10 WT, 10 KO). e, Representative 
images of wild-type (black) and knockout (red) strides. Forepaw position 
is represented by green paint and hindpaw position is represented by pink 
paint (scale bar, 2 cm). Quantification reveals elongated stride length and 
width (1=10 WT, 11 KO). f, Knockout mice show marked reductions in 
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grip strength as measured by the hanging wire test (n = 12 WT, 11 KO). 

g, h, Knockout mice attack intruder mice for a longer duration (g) 

and with a shorter latency to attack (h) in the resident-intruder test 

for aggression (n= 10 WT, 10 KO). Two-way repeated measures 

ANOVA with Bonferroni post-hoc tests (a, c), one-way ANOVA with 
Bonferroni multiple comparison tests (b, d), two-tailed t-tests (e, f) and 
Wilcoxon rank-sum tests (g, h) were used for statistical analysis. Chance 
performance (25%) represented by dashed grey lines (b, d). Error bars, 
mean + s.e.m (a, e); mean (b, d, e-f), median (g, h). *P < 0.05; **P < 0.01; 
***EP < 0.001. 
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Extended Data Figure 6 | Hyperactivity, hypotonia, and learning 
deficits in C57/129 Ptchd1-knockout mice. a, Ptchd1-knockout mice 
showed increased locomotor activity (n = 10 WT, 11 KO). b, c, Knockout 
mice show decreased mean holding time in the hanging wire test (b) 

(n= 10 WT, 12 KO), but normal motor coordination in the rotarod task 
(c) (n= 10 WT, 10 KO). d-f, Sensory responses as measured by acoustic 
startle (d), pre-pulse inhibition (e), and hot plate (f) are also normal in 
knockout mice (n= 10 WT, 12 KO). g-h, Normal sociability (g) and novel 


social recognition (h) in mixed background Ptchd1-knockout mice (n= 10 
WT, 11 KO). i, Knockout mice show impaired associative learning and 
memory in the inhibitory avoidance task (n = 9 WT, 12 KO). Two-tailed 
t-tests (b, f), one-way ANOVA with Bonferroni multiple comparison tests 
(g, h), and two-way repeated measures ANOVA with Bonferroni post-hoc 
tests (a, c-e, i) were used for statistical analysis. Error bars, mean + s.e.m.; 
horizontal bars, mean (b, f-i). *P < 0.05; **P < 0.01; ***P< 0.001. 
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behaviours in Ptchd1-knockout mice. a, Knockout mice do not show tailed t-tests (a) and two-way repeated measures ANOVA with Bonferroni 
excessive or injurious grooming behaviours (n= 9 WT, 13 KO). post-hoc tests (b, c) were used for statistical analysis. Horizontal bars, 
b, c, Knockout mice spent comparable amounts of time interacting with mean. ***P< 0.001. 
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Extended Data Figure 8 | YFP overlap with somatostatin interneuron (BGH) cassette. b, YFP* cells co-label with anti-GAD67 antibody in TRN 
marker is primarily confined to the TRN in Ptchd1-YFP mice. and the Purkinje layer of the cerebellum, but not in cortex or striatum. 
a, Schematic describing strategy to create Ptchd1- YFP mouse in which c, YFP* cells also co-label with anti-somatostatin antibody in TRN, but 
exon 1 was replaced with a YFP-bovine growth hormone poly-A tail not in other structures. Arrows denote overlap. Scale bars, 201m. 
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Extended Data Figure 9 | Sst-Cre recombinase activity is early and shows magnified image taken with 20x objective. b, c, At P15 (b) and 
robust in TRN neurons. a, The progeny of Sst-Cre* mice crossed to mice P30 (c), Cre recombinase activity in the TRN of the Sst-Cret TdTomato* 
showing Cre-dependent expression of the TdTomato fluorescent protein mice brains is robust, as shown by the inset depicting the significant 
(Sst-Cret TdTomato*) show TdTomato* cells in the TRN at P4. Inset TdTomato overlap with the pan-neuronal marker NeuN. 
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Extended Data Figure 10 | Genetic disruption of Ptchd1 TRN 
expression affects sleep stability but not grip strength or aggressive 
behaviours. a, b, Sst-Cre* Ptchd1*" mice appear normal in the hanging 
wire (a) (n= 12 Ptchd1*!*, 11 Ptchd1*") and resident intruder task 

(b) (n=6 Ptchd1*'*, 6 Ptchd1™""), c-e, Sst-Cre* Ptchd1* mice show 
reductions in sleep bout duration as shown in cumulative probability 
plot and comparison of medians (d) with no differences in total time 
spent sleeping when compared to Sst-Cre*:Ptchd1*'* littermates (e) 
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(n=10 Ptchd1‘'*, 10 Ptchd1*""), £, g, 1-EBIO treatment has no effect on 
performance on the hanging wire (f) or resident intruder task (g) (n=6 
WT veh., 6 WT 1-EBIO, 6 KO veh., 6 KO 1-EBIO). Kolomgorov—Smirnov 
test (a), Wilcoxon rank-sum tests (b, c), two-tailed t-tests (d), and two- 
way repeated measures ANOVA with Bonferroni post-hoc tests (f), 

and Kruskal-Wallis with Dunn’s multiple comparisons tests were used 

for statistical analysis. Horizontal bars, median (b, c, e-g), mean (d, f). 
*P< 0.05; **P< 0.01; ***P< 0.001. 
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Cytolytic proteins and peptide toxins are classical virulence factors of several bacterial pathogens which disrupt epithelial 
barrier function, damage cells and activate or modulate host immune responses. Such toxins have not been identified 
previously in human pathogenic fungi. Here we identify the first, to our knowledge, fungal cytolytic peptide toxin in the 
opportunistic pathogen Candida albicans. This secreted toxin directly damages epithelial membranes, triggers a danger 
response signalling pathway and activates epithelial immunity. Membrane permeabilization is enhanced by a positive 
charge at the carboxy terminus of the peptide, which triggers an inward current concomitant with calcium influx. 
C. albicans strains lacking this toxin do not activate or damage epithelial cells and are avirulent in animal models of 
mucosal infection. We propose the name ‘Candidalysin’ for this cytolytic peptide toxin; a newly identified, critical 
molecular determinant of epithelial damage and host recognition of the clinically important fungus, C. albicans. 


The ability of mucosal surfaces to discriminate between commensal and 
pathogenic microbes is essential to human health. The fungus Candida 
albicans is normally a benign member of the human microbiota, but is 
also responsible for millions of mucosal infections each year in 
immunocompromised hosts, often with severe morbidity!. A defin- 
ing feature of C. albicans pathogenesis is the transition from yeast to 
invasive filamentous hyphae’. Hyphae damage mucosal epithelia and 
induce activation of the activating protein-1 (AP-1) transcription factor, 
c-Fos (via p38-MAPK) and the MAPK phosphatase MKP1 
(via ERK1/2-MAPK), which trigger pro-inflammatory cytokine 
responses*’. These signalling events constitute a ‘danger response’ 
against invasive hyphae, thus serving as a sensor of pathogenic C. albicans 
invasion®-'4, However, it is unclear how C. albicans hyphae induce 
epithelial inflammatory responses and cell damage during mucosal 
infections. Here we identify and characterize Candidalysin, the first, 
to our knowledge, cytolytic peptide toxin isolated from any human 
fungal pathogen, as the hyphal factor critical for epithelial immune 
activation and C. albicans mucosal infection. 


Ecelp is vital for epithelial activation and damage 

Despite the well-known association between filamentation and viru- 
lence, the molecular mechanism underlying hypha-driven epithelial 
activation and mucosal damage has remained obscure. To elucidate this 
mechanism, we screened a panel of C. albicans gene deletion mutants 
that targeted key processes, pathways and proteins known or predicted 
to be associated with the yeast—hyphal transition and pathogenicity (62 
strains). Only hypha-producing strains induced MKP1 phosphorylation 


(p-MKP1), c-Fos, cytokines (IL-la, IL-6, G-CSF) and damage in oral 
epithelial cells (Extended Data Table 1). However, one C. albicans 
mutant (ecel A/A)!* formed normal hyphae but was incapable of 
inducing these epithelial danger responses. The C. albicans extent of 
cell elongation 1 gene (ECE1) is highly expressed by hyphae during epi- 
thelial infection (Extended Data Fig. 1a, b), and is predicted to encode 
a secreted protein’®. To probe its function, we generated a panel of 
C. albicans ECE1 mutants (Extended Data Table 2). The ece1A/A strain 
formed normal hyphae on (Extended Data Fig. 1c), and adhered to 
and invaded human epithelial cells similarly to wild-type C. albicans 
(Extended Data Fig. 1d, e). The strain ece] A/A was capable of exten- 
sive epithelial invasion, penetrating through multiple epithelial cells 
(Extended Data Fig. 1f). Despite this, invasive ece1 A/A hyphae did not 
damage epithelia or induce p-MKP1/c-Fos-mediated danger responses 
or cytokine secretion (Fig. la—d). Thus, Ece1p is critical for epithelial 
damage and innate recognition of C. albicans hyphae in vitro. 


Ecelp is critical for mucosal pathogenesis 

We next assessed the role of ECE] in two in vivo models of C. albicans 
mucosal infection. In murine oropharyngeal candidiasis (OPC)"’, 
mice infected with C. albicans wild-type or ECE1 re-integrant 
(ece1A/A + ECE1) strains exhibited disease symptoms, including 
extensive hyphal invasion of the tongue epithelium, micro-abscesses 
of infiltrating neutrophils and tissue damage (Fig. le, f, h, i). In contrast, 
tongue tissue from ece1 A/A-infected animals (n = 17/20) showed no 
invasive fungi and no inflammatory infiltrates or damage (Fig. 1g). 
We detected very low numbers of ece1A/A cells in only 3/20 mice 
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Institute, D-O7745 Jena, Germany. 8Centre for Ultrastructural Imaging, King’s College London, London SE1 1UL, UK. 2Department of Life Sciences, Imperial College London, London SW7 2AZ, 
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Figure 1 | ECE] is required for epithelial activation and C. albicans 
infection. TR146 cells were infected with the indicated C. albicans strains. 
a, LDH release 24h post-infection (p.i.) (MOI = 0.1). b, Induction of 
p-MKP-1 and c-Fos at 2h p.i. (MOI = 10). ¢, c-Fos DNA binding at 3h 

p.i. (MOI= 10). d, G-CSF production at 24h p.i. (MOI= 0.01). e-i, PAS- 
stained tongues from mice subjected to OPC 2 days p.i. Whole-mount 
(x25) (e, g, h) and high-magnification (f, i) (x 200) views of PAS-stained 
tongues of mice infected with C. albicans wild type (e, f), ece1A/A (g) 

and ece1A/A + ECE1 (h, i). Invading hyphae (black arrowhead) and 
inflammatory cells (blue arrowhead) are indicated. j, Quantification of 
neutrophils in zebrafish swimbladder following infection with wild-type 
C. albicans (number of fish (n) = 47), ece1A/A (n= 53) or PBS (n= 40). 
k, Quantification of damaged cells in zebrafish swimbladder after infection 
with C. albicans wild-type (n =73) or ece1A/A strains (n = 59) or vehicle 
(n= 63). Data are representative (b, e-i) or the mean (a, ¢, d, j, k) of three 
biological replicates. Error bars + s.e.m. Data were analysed by one-way 
ANOVA (a, d), paired t-test (c) or Kruskal-Wallis test (j, k). *P < 0.05, 
**P < 0.01, ***P < 0.001. For gel image, see Supplementary Fig. 1. 


(Extended Data Fig. 2a), which showed no evidence of local epithelial 
damage (data not shown). Quantification of histology sections indi- 
cated that the percentage of epithelial surface infected was significantly 
greater with the wild-type and ECE] re-integrant strains (Extended 
Data Fig. 2b). In a zebrafish swimbladder model of mucosal infec- 
tion'®!9, neutrophil recruitment and tissue damage were both sig- 
nificantly lower following ece1 A/A infection as compared with the 
wild-type strain (Fig. 1j, k and Extended Data Fig. 2c, d). Therefore, 
C. albicans Ece1p is critical for mucosal pathogenesis and is an innate 
immune activator in vivo. 


Ecelp encodes a cytolytic peptide toxin 

Ecelp is an in vitro substrate for Kex2p, a Golgi-located protease that 
cleaves proteins after lysine-arginine (KR) motifs”°. Ecelp contains 
seven KR-processing sites, suggesting it has the potential to produce 
eight secreted peptides from C. albicans*° (Extended Data Fig. 3a, b). 
Liquid chromatography-tandem mass spectrometry (LC-MS/MS) 
analysis confirmed that recombinant Kex2p (rKex2p) processes recom- 
binant Ecelp (rEce1p), and that all eight peptides generated terminated 
in KR (and fragments thereof, showing that less efficient processing 
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occurs also after a single K or R) (Supplementary Information). The 
importance of Kex2p-mediated Ecelp processing was demonstrated 
using a kex2A/A null strain?!, which was unable to damage oral epi- 
thelia or induce p-MKP1/c-Fos-mediated danger responses or cytokine 
secretion (Extended Data Table 1). To determine which Ecelp pep- 
tide(s) were responsible for epithelial activation and damage, oral epi- 
thelial cells were incubated with peptides Ece1-I-VII (1.5-70 1M). 
Only Ece1-TIII¢2-93 induced p-MKP1, c-Fos, cytokines and damage 
(Fig. 2a—c, Extended Data Fig. 3c—e). Notably, low Ecel-III¢2-93 con- 
centrations (1.5-15 1M) were sufficient to induce c-Fos DNA binding 
(Fig. 2d), G-CSF and GM-CSF (Fig. 2c and Extended Data Fig. 3c), 
whereas high Ecel-III¢2-93 concentrations (701M) were required to 
induce damage (Fig. 2e) and the damage-associated cytokines IL-la 
and IL-6, respectively (Extended Data Fig. 3d, e). Ecel-II¢2-93 could 
also directly lyse multiple human epithelial cell types and induce 
haemolysis of red blood cells, a classical test for cytotoxin activity (not 
shown). Neither the N-terminal hydrophobic region (Ecel-II]¢2-s5) nor 
the C-terminal hydrophilic region (Ece1-II]g¢_93) induced p-MKP1, 
c-Fos, cytokines or damage of epithelial cells, either individually or in 
combination (Extended Data Fig. 3f-h), demonstrating that the peptide 
containing both regions is required for activity. Therefore, Ece1-II]¢2-93 
is the active region of Ecelp, acting as an epithelial immune activator 
and a cytolytic agent. 

To confirm that Ecel-[II¢2_93 drives epithelial activation and fun- 
gal pathogenicity, we generated a C. albicans strain lacking only the 
Ecel-IIg2-93 region (ece1A/A + ECE1 qi 4-279). LC-MS/MS analy- 
sis showed that the modified protein in this strain is stable, secreted 
and processed into each of the predicted peptide fragments, with the 
exception of the deleted peptide toxin (Supplementary Information). 
Like ece1A/A, ece1A/A + ECE1 484-279 efficiently formed invasive 
hyphae (not shown). However, ece1 A/A + ECE1 jj 84-279 was unable to 
induce p-MKPI1, c-Fos DNA binding, cytokines, or damage epithelia 
(Extended Data Fig. 3i-l). In murine OPC, unlike the ece1 A/A + ECE1 
complemented strain, ecel A/A + ECE1 jjg4-279-infected mice demon- 
strated absent (n = 4/10) or low (n=6/10) fungal burdens, with no 
evidence of inflammatory infiltrates or local epithelial damage 
(Fig. 2f-h and Extended Data Fig. 4a, b). Likewise, ecel1A/A + 
ECE1 qjg4-279 did not induce full damage in the zebrafish swimblad- 
der model (Fig. 2i and Extended Data Fig. 4c). In contrast, injection 
of lytic doses of Ece1-II¢2-93 into the swimbladder induced epithelial 
damage (Fig. 2j, k). Thus, Ece1-III¢2-93 is both necessary and sufficient 
for epithelial immune activation, damage and mucosal infection in vivo. 

The amphipathic properties of Ece1-III¢2_-93 (SIIGIIMGILGNIPQ 
VIQUMSIVKAFKGNKR) coupled with the a-helical structure of the 
N-terminal hydrophobic region (Extended Data Fig. 5a, b) indicated 
that this fungal peptide may act similarly to cationic antimicrobial 
peptides and peptide toxins such as melittin”” (honey bee), magainin 2 
(ref. 23) (African clawed frog) and alamethicin” (Trichoderma 
viride). Cytolytic peptide toxins have not previously been found in 
human pathogenic fungi, but bacterial cytolytic toxins are known to 
induce lesions after binding to target cell membranes”*”*. To investigate 
the importance of lipid composition for Ece1-III¢._93-mediated cytol- 
ysis, we used Forster resonance energy transfer (FRET) and electrical 
impedance spectroscopy to analyse the interactions of Ece1-IL]¢2-93 
with model membranes comprised of lipid bilayers of dioleoylphos- 
phatidylcholine (DOPC) with or without cholesterol. Although Ecel- 
I1]¢2-93 was able to efficiently intercalate into DOPC membranes 
(Extended Data Fig. 5c), Ece1-III¢2-53 permeabilization was enhanced 
in the presence of cholesterol (Fig. 3a). Ece1-III¢2_93-induced lesions 
were heterogeneous and transient (Extended Data Fig. 5d), indicating 
that the peptide may damage target membranes through a ‘carpet-like’ 
mechanism’. Patch-clamp analysis of epithelial cells demonstrated that 
lesion formation by Ece1-I¢2-93 is rapid and causes an inward current 
(Fig. 3b), associated with calcium influx (Fig. 3c). Similar phenomena 
occur with bacterial cytolytic toxins, which are known to trigger cell 
activation?>?8, 
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Figure 2 | Ece1-III¢2_93 is the active region of 
Ecelp and is required for TR146 cell activation 
and mucosal C. albicans infection. a, Induction 
of p-MKP-1 and c-Fos 2h post-stimulation (p.s.) 
with Ecel peptides at 1.5\1M. b, LDH release 

24h p.s. with 70 1M Ecel peptides. c, Induction 
of G-CSF 24h p.s. with Ece1-III¢2_93. d, c-Fos 
DNA binding induction 3h p.s. with sub-lytic 
concentrations of Ece1-III¢2_93. e, LDH release 24h 
p.s. with Ece1-III¢2-93. f-h, PAS-stained tongue 
sections from mice subjected to OPC, 2 days p.i. 
with C. albicans ece1A/A + ECE1 (x25 and x200) 
(f, g) or ece1A/A + ECE1 ji84-79 (h). Invading 
hyphae (black arrowheads) and infiltrating 
inflammatory cells (blue arrowhead) are shown. 
i, Damaged cells in a zebrafish swimbladder 24h 
p.i. with C. albicans ece1A/A + ECE1 (number of 
fish (n) = 44), ecelA/A + ECE1 4 184.279 (n= 58) 
or vehicle (n = 58). j, Damaged cells in zebrafish 
swimbladders after stimulation with 9 ng (n = 51) 
or 1.25 ng (n= 56) Ece1-II]¢2-93, or vehicle (40% 
DMSO, n = 54 and 5% DMSO, n =55). k, Co- 
localization of adherens junctions («-catenin- 
citrine) with Ece1-II¢2-93-damaged cells (Sytox 
Orange-positive cells) in a zebrafish swimbladder. 
Scale bar, 100,1m. Data are representative (a, f-h, k) 
or mean (b-e, i, j) of three biological replicates 
(a-d) or ten mice or fish (f-h, k). Error bars 
show + s.e.m. Data were analysed by one-way 
ANOVA (b, ¢, e) paired t-test (d) or Kruskal- 
Wallis (i, j). *P < 0.05, **P <0.01, ***P <0.001 
(compared with vehicle control unless otherwise 
indicated). For gel image, see Supplementary Fig. 1. 


Figure 3 | Ece1-III¢2-93 functions as a cytolytic 
peptide toxin. a, Kinetic changes in conductance 
of tethered lipid membranes after exposure 

to different concentrations of Ece1-III¢2_93. 

b, Evoked inward current at a membrane potential 
of —60 mV in TR146 cells post-addition of Ece1- 
III¢2-93 or ionomycin (positive control); individual 
(representative) and cumulative changes 

(bar chart indicates number of cells analysed 
below each bar) shown. ¢, Intracellular calcium 
level kinetics in TR146 cells post-stimulation 
(p.s.) with Ecel-III¢2_93 wild type (Ecel -III¢2_-93xr) 
or Ecel-III¢2-93 AA C-terminal substitution 
(Ecel-II¢2-9344). d, Kinetic changes in 
conductance of tethered DOPC membranes 

after exposure to different concentrations of 
Ecel-III¢2-93xr and Ecel-III¢2_9344- e, f, LDH 
release (e) and secretion of IL-la (f) from TR146 
cells 24h p.s. with Ecel-Il]¢2-93xp or Ecel-HI¢2_93,a- 
Data shown are representative (a, d) or mean 

(b, c, e, f) of three biological replicates. Error bars 
show + s.e.m. Data were analysed by one-way 
ANOVA (c, e and f). **P< 0.01, ***P< 0.001. 
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Figure 4 | Ece1-III¢2_92x functions as a cytolytic peptide toxin that 
activates and damages epithelial cells. a-c, Induction of p-MKP-1 and 
c-Fos 2h post-stimulation (p.s.) (a), secretion of G-CSF and IL-1-a 24h 
p.s. (b), and LDH release 24h p.s. of TR146 cells with Ece1-III¢2_92x (¢). 
d, Férster resonance energy transfer (FRET) showing intercalation 

of Ece1-III2_92x (101M) into lipid liposomes. e, Average peptide 
concentration-dependent changes in conductance of tethered lipid 
membranes. f, Ece1-III¢2-92x (441M) induced permeabilization of planar 
lipid membranes showing heterogeneous and transient lesions leading to 
membrane rupture. g, Intracellular calcium level kinetics in TR146 cells 
p.s. with Ece1-III¢2-92x. Data shown are representative (a, d, f) or mean 
(b, c, e, g) of three biological replicates. Error bars show + s.e.m. Data are 
analysed by one-way ANOVA (b, c). *P < 0.05, **P < 0.01, ***P< 0.001 
(compared with vehicle control). For gel image, see Supplementary Fig. 1. 


We postulated that the positively-charged C-terminal KR residues of 
Ece1-III¢2-93 might be critical for interacting with negatively-charged 
components of host membranes to promote lesion formation. 
Substitution of the KR motif to AA (alanine—alanine; Ece1-II¢2-934 a) 
did not affect membrane intercalation (not shown) but significantly 
reduced the peptide’s ability to permeabilize membranes, damage 
epithelial cells and induce calcium influx (Fig. 3c-e). Thus, the pos- 
itive C terminus of Ecel -III¢2_93 is critical for lesion formation and 
damage induction in epithelial membranes. Notably, Ece1-I 162-9344 
still induced p-MKP1, c-Fos and the non-damage associated cytokine 
G-CSF (Extended Data Fig. 5e, f), but not the damage-associated 
cytokine IL-1a (Fig. 3f), suggesting that Ece1-III62-934 can be recog- 
nized by epithelial immunity without damaging cells. The importance 
of this finding is that it means epithelial cells are not only responding to 
damage, but have evolved to specifically recognize the peptide. 


Ecel-III¢2_92x is a secreted cytolytic peptide toxin 

To demonstrate that Ece1-IIJ is generated during epithelial infection, 
we performed LC-MS/MS analysis on the secretome from wild-type 
C. albicans hyphae grown in the presence and absence of epithelial 
cells (Supplementary Information). Notably, Ece1-III was the only 
peptide detected in the presence of epithelial cells, indicating that 
the fungus secretes this toxin during mucosal infection. However, 
the predominant form of secreted Ece1-III terminated in a K residue 


ARTICLE 


(SIIGIMGILGNIPQVIQIIMSIVKAFKGNK; Ecel -II¢2-92x) and not 
KR (SIIGIIMGILGNIPQVIQUMSIVKAFKGNER; Ecel -III¢2_93Kr) 
(Extended Data Table 3). In fungi, it is known that following Kex2p 
processing, many proteins are subsequently cleaved by Kex1p” (also 
in the Golgi), removing the C-terminal R. LC-MS/MS analysis on the 
hyphal secretome of a kex1 A/A mutant demonstrated that the pre- 
dominant peptide secreted terminates in KR (not K) (Supplementary 
Information). Therefore, Ece1p is also subject to ordered Kex2p/Kex1p 
processing. Accordingly, we confirmed that Ece1-II62-92k functioned 
similarly to Ece1-III¢2-93xr with respect to epithelial cell activation. 
Specifically, Ece1-II]¢2-92x is also a-helical (not shown) and induces 
c-Fos, p-MKP1, cytokines (IL-1a, G-CSF), damage (LDH), membrane 
intercalation and permeabilization, and calcium influx (Fig. 4a-g). 
Thus, the dominant peptide secreted from C. albicans hyphae during 
mucosal infection is Ece1-III¢2-92x, which acts as a cytolytic peptide 
toxin that activates epithelial cells. 

Based on these data, we propose a model of C. albicans mucosal 
infection whereby invasive hyphae secrete Ece1-III¢2-92x into a 
membrane-bound ‘invasion pocket’>°"', facilitating peptide accumula- 
tion (Extended Data Fig. 6). During early stages of infection, sub-lytic 
concentrations of Ece1-III¢2-92x induce epithelial immunity by activat- 
ing the ‘danger response’ pathway (p-MKP1/c-Fos), alerting the host 
to the transition from colonizing yeast to invasive, toxin-producing 
hyphae. As infection progresses, Ece1-I1I¢-92x levels accumulate 
and elicit direct tissue damage. Mechanistically, we propose that the 
asymmetric distribution of charge along the a-helix of Ece1-I¢2-92x 
facilitates correct peptide orientation relative to the host membrane, 
enabling intercalation, permeabilization and calcium influx. Our data 
identifies C. albicans Ece1-II1¢2-92x as the first cytolytic peptide toxin 
in a human fungal pathogen and reveals the molecular mechanisms 
of epithelial damage and host recognition of this clinically important 
fungus. We propose the name ‘Candidalysir for this newly discovered 
fungal toxin. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded to allocation 
during experiments and outcome assessment. 

Cell lines, reagents and Candida strains. Experiments were carried out using 
the TR146 buccal epithelial squamous cell carcinoma line’ obtained from the 
European Collection of Authenticated Cell Cultures (ECACC) and grown in 
Dulbecco’s Modified Eagle’s Medium (DMEM, Sigma-Aldrich) supplemented with 
10% fetal bovine serum (FBS) and 1% penicillin-streptomycin. Cells were routinely 
tested for mycoplasma contamination using mycoplasma-specific primers and 
were found to be negative. Prior to stimulation, confluent TR146 cells were serum- 
starved overnight, and all experiments were carried out in serum-free DMEM. 
C. albicans wild-type strains included the autotrophic strain BWP17 + CIp30 
(ref. 33) and the parental strain SC5314 (ref. 34). Other C. albicans strains used 
and their sources are listed in Extended Data Tables 1 and 2. C. albicans cultures 
were grown in YPD medium (1% yeast extract, 2% peptone, 2% dextrose) at 30°C 
overnight. Cultures were washed in sterile PBS and adjusted to the required 
cell density. Antibodies to phospho-MKP1 and c-Fos were from Cell Signalling 
Technologies (New England Biolabs UK), mouse anti-human «-actin was from 
Millipore (UK), and goat anti-mouse and anti-rabbit horseradish peroxidase 
(HRP)-conjugated antibodies were from Jackson Immunologicals (Stratech 
Scientific, UK). Ecelp peptides were synthesized commercially (Proteogenix 
(France) or Peptide Synthetics (UK). 

Generation of C. albicans ECE1 mutant strains. ECE] deletion was performed 
as previously described*». Deletion cassettes were generated by PCR**. Primers 
ECE1-FG and ECE1-RG were used to amplify pFA-HIS1 and pFA-ARG4 -based 
markers. C. albicans BWP17 (ref. 37), was sequentially transformed** with the 
ECE1-HIS1 and ECE1-ARG4 deletion cassettes and then transformed with 
ClIp10 (ref. 39), yielding the ece1A/A deletion strain. For complementation, the 
ECE1 gene plus upstream and downstream intergenic regions were amplified 
with primers ECE1-RecF3k and ECE1-RecR and cloned into plasmid CIp10 at 
Mlul and Sall sites. This plasmid was transformed into the uridine auxotrophic 
ecelA/A strain, yielding the ecel1A/A + ECE1 complemented strain. For gener- 
ation of the ece] A/A + ECE1,j184-279 strain, the CIp10-ECE1 was amplified with 
primers Pep3-F1 and Pep3-R1, digested with Clal and re-ligated, yielding the 
ClIp10+ ECE1 jj84-279 plasmid. This plasmid was transformed into the uridine 
auxotrophic ece1A/A strain, yielding the ece1 A/A + ECE1 a1 g4-279 strain. All 
integrations were confirmed by PCR/sequencing and at least two independent 
isogenic transformants were created to confirm results. KEX1 deletion was per- 
formed exactly as the ECE1 deletion but using primers KEX1-FG and KEX1-RG 
for creating the deletion cassette. Fluorescent strains of ece1 A/A and BWP17 were 
constructed as previously described“°. Briefly, the ece1 A/A and BWP17 strains 
were transformed with the pENO1-dTom-NATr plasmid. Primers used to clone 
and construct the ECE1 genes and intragenic regions are listed in Extended Data 
Table 4. Strains are listed in Extended Data Table 2. 

Construction of C. albicans ECE1 promoter-GFP strain. ECE] promoter (prim- 
ers 5'/ECEIprom-Narl / 3‘ECE1prom-Xhol) and terminator (5'ECEIterm-SaclI / 
5/ECE1term-Sacl) were amplified and cloned into pADH1-GFP. Resulting pSK- 
pECE1-GFP was verified by sequencing. C. albicans SC5314 was transformed with 
the pECE1-GFP transformation cassette**, Resistance to nourseothricin was used 
as selective marker and correct integration of GFP into the ECE] locus was verified 
by PCR. Primers for cloning and validation are listed in Extended Data Table 4. 
Strains are listed in Extended Data Table 2. 

RNA isolation and real-time PCR analysis. C. albicans cells grown on TR146 
epithelial cells were collected into RNA pure (PeqLab), centrifuged and the 
pellet resuspended in 40011 AE buffer (50 mM Na-acetate pH 5.3, 10 mM 
EDTA, 1% SDS). Samples were vortexed (30s), and an equal volume of phenol/ 
chloroform/isoamyl alcohol (25:24:1) was added and incubated for 5 min (65°C) 
before subjected to 2x freeze-thawing. Lysates were clarified by centrifugation 
and the RNA precipitated with isopropyl alcohol/0.3 M sodium acetate by incu- 
bating for 1h at —20°C. Precipitated pellets were washed (2 1 ml 70% ice-cold 
ethanol), resuspended in DEPC-treated water and stored at —80°C. RNA integrity 
and concentration was confirmed using a Bioanalyzer (Agilent). RNA (500 ng) was 
treated with DNase (Epicentre) and cDNA synthesized using Reverse Transcriptase 
Superscript III (Invitrogen). cDNA samples were used for qPCR with EVAgreen 
mix (Bio&Sell). Primers (ACT1-F and ACT1-R for actin, ECE1-F and ECE1-R 
for ECE1 Extended Data Table 4) were used at a final concentration of 500nM. 
qPCR amplifications were performed using a Biorad CFX96 thermocycler. Data 
was evaluated using Bio-Rad CFX Manager 3.1 (Bio-Rad) with ACT as the refer- 
ence gene and ty as the control sample. 

Western blotting. TR146 cells were lysed using a modified RIPA lysis buffer 
(50mM Tris-HCl pH 7.4, 150 mM NaCl, 1mM EDTA, 1% Triton X-100, 1% 
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sodium deoxycholate, 0.1% SDS) containing protease (Sigma-Aldrich) and 
phosphatase (Perbio Science) inhibitors”!, left on ice (30 min) and then clarified 
(10 min) in a refrigerated microfuge. Lysate total protein content was determined 
using the BCA protein quantitation kit (Perbio Science). 201g of total protein was 
separated on 12% SDS-PAGE gels before transfer to nitrocellulose membranes 
(GE Healthcare). After probing with primary (1:1,000) and secondary (1:10,000) 
antibodies, membranes were developed using Immobilon chemiluminescent sub- 
strate (Millipore) and exposed to X-ray film (Fuji film). Human a-actin was used 
as a loading control. 

Transcription factor DNA binding assay. DNA binding activity of transcription 
factors was assessed using the TransAM transcription factor ELISA system (Active 
Motif) as previously described*!”. Serum-starved TR146 epithelial cells were 
treated for 3h before being differentially lysed to recover nuclear proteins using 
a nuclear protein extraction kit (Active Motif) according to the manufacturer's 
protocol. Protein concentration was determined (BCA protein quantitation kit 
(Perbio Science)) and 5 \.g of nuclear extract was assayed in the TransAM system 
according to the manufacturer's protocol. Data was expressed as fold-change in 
A450 nm Felative to resting cells. 

Cytokine determination. Cytokine levels in cell culture supernatants were deter- 
mined using the Performance magnetic Fluorokine MAP cytokine multiplex kit 
(Bio-techne) and a Bioplex 200 machine. The data were analysed using Bioplex 
Manager 6.1 software to determine analyte concentrations. 

Cell damage assay. Following incubation, culture supernatant was collected 
and assayed for lactate dehydrogenase (LDH) activity using the Cytox 96 Non- 
Radioactive Cytotoxicity Assay kit (Promega) according to the manufacturer’s 
instructions. Recombinant porcine LDH (Sigma-Aldrich) was used to generate 
a standard curve. 

Epithelial adhesion assay. Quantification of C. albicans adherence to TR146 epi- 
thelial cells was performed as described previously’’. Briefly, TR146 cells were 
grown to confluence on glass coverslips for 48h in tissue culture plates in DMEM 
medium. C. albicans yeast cells (2 x 105) were added into 1 ml serum-free DMEM, 
incubated for 60 min (37 °C/5% CO ) and non-adherent C. albicans cells removed 
by aspiration. Following washing (3 x 1 ml PBS), cells were fixed with 4% para- 
formaldehyde (Roth) and adherent C. albicans cells stained with Calcofluor White 
and quantified using fluorescence microscopy. The number of adherent cells was 
determined by counting 100 high-magnification fields of 200|1m x 200,1m size. 
Exact total cell numbers were calculated based on the quantified areas and the 
total size of the cover slip. 

Epithelial invasion assay. C. albicans invasion of epithelial cells was determined as 
described previously”. Briefly, TR146 epithelial cells were grown to confluence on 
glass coverslips for 48h and then infected with C. albicans yeast cells (1 10°), for 
3h in a humidified incubator (37 °C/5% COz). Following washing (3x PBS), the 
cells were fixed with 4% paraformaldehyde. All surface adherent fungal cells were 
stained for 1h with a rabbit anti-Candida antibody and subsequently with a goat 
anti-rabbit-Alexa Fluor 488 antibody. After rinsing with PBS, epithelial cells were 
permeabilized (0.1% Triton X-100 in PBS for 15 min) and fungal cells (invading 
and non-invading) were stained with Calcofluor White. Following rinsing with 
water, coverslips were visualized using fluorescence microscopy. The percentage 
of invading C. albicans cells was determined by dividing the number of (partially) 
internalized cells by the total number of adherent cells. At least 100 fungal cells 
were counted on each coverslip. 

Imaging of C. albicans growth and invasion of epithelial cells. TR146 cells 
(10° per ml) seeded on glass coverslips in DMEM/10% FBS were infected with 
C. albicans (2.5 x 104 cfu per ml) in DMEM and incubated for 6h (37 °C/5% COs). 
Cells were washed with PBS, fixed overnight (4°C in 4% paraformaldehyde) and 
stained with Concanavalin A-Alexa Fluor 647 in PBS (101g ml‘) for 45 min at 
room temperature in the dark with gentle shaking (70 r.p.m.) to stain the fungal 
cell wall. Epithelial cells were permeabilized with 0.1% Triton X-100 for 15 min 
at 37°C in the dark, then washed and stained with 10j1g ml! Calcofluor White 
(0.1 M Tris-HCl pH 9.5) for 20 min at room temperature in the dark with gentle 
shaking. Cells were rinsed in water and mounted on slides with 611 of ProLong 
Gold anti-fade reagent, before air drying for 2h in the dark. Fluorescence micros- 
copy was performed on a Zeiss Axio Observer Z1 microscope, and 5 phase images 
were taken per picture. 

Scanning electron microscopy. For scanning electron microscopy (SEM) analysis, 
TR146 cells were grown to confluence on Transwell inserts (Greiner) and serum 
starved overnight in serum-free DMEM. After 5h of C. albicans incubation on 
epithelial cells at an MOI of 0.01, cell media was removed and samples were fixed 
overnight at 4°C with 2.5% (v/v) glutaraldehyde in 0.05 M HEPES buffer (pH 7.2) 
and post-fixed in 1% (w/v) osmium tetroxide for 1h at room temperature. After 
washing, samples were dehydrated through a graded ethanol series before being 
critical point dried (Polaron E3000, Quorum Technologies). Dried samples were 
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mounted using carbon double side sticky discs (TAAB) on aluminium pins (TAAB) 
and gold coated in an Emitech K550X sputter coater (Quorum Technologies Ltd). 
Samples were examined and images recorded using a FEI Quanta 200 field emis- 
sion scanning electron microscope operated at 3.5kV in high vacuum mode. 
Zebrafish swimbladder mucosal infection model. Zebrafish infections were per- 
formed in accordance with NIH guidelines under Institutional Animal Care and 
Use Committee (IACUC) protocol A2009-11-01 at the University of Maine. To 
determine sample size, a power calculation was done for all experiments based on 
two-tailed t-tests in order to detect a minimum effect size of 0.8, with an alpha error 
probability of 0.05 and a power (1 — beta error probability) of 0.95. This gave a min- 
imum number of 42 fish for each group. The fish selected for the experiments were 
randomly assigned to the different groups by picking them from a pool without bias 
and the groups were injected in different orders. No blinding was used to read the 
results. Ten to twenty zebrafish per group per experiment were maintained at 33°C 
in E3+ PTU and used as previously described. Briefly, 4 days post-fertilization 
(dpf) larvae were treated with 20,.gml! dexamethasone dissolved in 0.1% DMSO 
1h before infection and thereafter. For tissue damage and neutrophil recruitment, 
individual AB or mpo:GFP fish (respectively) were injected into the swimbladder 
with 4nl of PBS with or without 25-40 C. albicans yeast cells of ece1A/A-dTomato, 
ece1 A/A + ECE1 + dTomato, ece1 A/A + ECE1 ,1g4-279 + dTomato or BWP17- 
dTomato. For tissue damage, 1 nl of Sytox green (0.05 mM in 1% DMSO) was 
injected at 20h post-infection into the swimbladder and fish were imaged by confo- 
cal microscopy at 24h post-infection. For neutrophil recruitment, fish were imaged 
at 24h post-injection. For synthetic peptide damage, AB or a-catenin:citrine fish 
were injected with 2 nl of peptide (9 ng or 1.25 ng per fish) or vehicle (40% DMSO 
or 5% DMSO) + SytoxGreen (0.05 mM in 1% DMSO) or SytoxOrange (0.5 mM 
in 10% DMSO) and the fish imaged by confocal microscopy 4h later. Numbers of 
neutrophils and damaged cells observed were counted and tabulated for each fish. 
Zebrafish swimbladder fluorescence microscopy. Live zebrafish imaging was car- 
ried out as previously described”. Briefly, fish were anaesthetized in Tris-buffered 
Tricaine (200 1g ml“!, Western Chemicals) and further immobilized in a solution 
of 0.4% low-melting-point agarose (LMA, Lonza) in E3 + Tricaine in a 96-well 
plate glass-bottom imaging dish (Greiner Bio-On). Confocal imaging was carried 
out using an Olympus [X-81 inverted microscope with an FV-1000 laser scanning 
confocal system (Olympus). Images were collected and processed using Fluoview 
(Olympus) and Photoshop (Adobe Systems). Panels are either a single slice for the 
differential interference contrast channel (DIC) with maximum projection over- 
lays of fluorescence image channels (red-green), or maximum projection overlays 
of fluorescence channels. The number of slices for each maximum projection is 
specified in the legend of individual figures. 

Murine oropharyngeal candidiasis model. Murine infections were performed 
under UK Home Office Project Licence PPL 70/7598 in dedicated animal facilities 
at King’s College London. No statistical method was used to pre-determine sample 
size. No method of randomization was used to allocate animals to experimental 
groups. Mice in the same cage were part of the same treatment. The investigators 
were not blinded during outcome assessment. A previously described murine 
model of oropharyngeal candidiasis using female BALB/c mice‘ was modified to 
use for investigating early infection events. Briefly, mice were treated subcutane- 
ously with 3 mg per mouse (in 20011 PBS with 0.5% Tween 80) of cortisone acetate 
on days —1 and +1 post-infection. On day 0, mice were sedated for ~75 min with 
an intra-peritoneal injection of 110 mg per kg ketamine and 8 mg per kg xylazine, 
and a swab soaked in a 10” cfu per ml of C. albicans yeast culture in sterile saline 
was placed sublingually for 75 min. After 2 days, mice were euthanized, the tongue 
excised and divided longitudinally in half. One half was weighed, homogenized 
and cultured to derive quantitative Candida counts. The other half was processed 
for histopathology and immunohistochemistry. 

Immunohistochemistry of murine tissue. C. albicans-infected murine tongues 
were fixed in 10% (v/v) formal-saline before being embedded and processed in 
paraffin wax using standard protocols. For each tongue, 5-|1m sections were pre- 
pared using a Leica RM2055 microtome and silane coated slides. Sections were 
dewaxed using xylene, before C. albicans and infiltrating inflammatory cells were 
visualized by staining using Periodic Acid-Schiff (PAS) stain and counterstaining 
with haematoxylin. Sections were then examined by light microscopy. Histological 
quantification of infection was undertaken by measuring the area of infected epi- 
thelium and expressed as a percentage relative to the entire epithelial area. 
Whole cell patch clamp. TR146 epithelial cells were grown in 35-mm Petri dishes 
(Nunc) for 48 h before recordings at low cell density (10-30% confluence). Cells 
were superfused with a modified Krebs solution (120 mM NaCl, 3mM KCI, 2.5mM 
CaCh, 1.2mM MgCh, 22.6mM NaHCO,, 11.1 mM glucose, 5mM HEPES pH 7.4). 
Isolated cells were recorded at room temperature (21-23 °C) in whole cell mode 
using microelectrodes (5-7 MQ) containing 90 mM potassium acetate, 20 mM 
KCl, 40mM HEPES, 3mM EGTA, 3mM MgCh, 1 mM CaCl, (free Ca” 40nM), 


pH 7.4. Cells were voltage clamped at —60 mV using an Axopatch 200A amplifier 
(Axon Instruments) and current/voltage curves were generated by 1s steps 
between —100 to +50 mV. Treatments were applied to the superfusate to produce 
the final required concentration, with vehicle controls similarly applied. Data was 
recorded using Clampex software (PClamp 6, Axon Instrument) and analysed with 
Clampfit 10. 

Calcium flux. TR146 cells were grown in a 96-well plate overnight until conflu- 
ent. The medium was removed and 50 ul of a Fura-2 solution (5 ul Fura-2 (Life 
Technologies) (2.5 mM in 50% Pluronic F-127 (Life Technologies):50% DMSO), 
5 ul probenecid (Sigma) in 5 ml saline solution (NaCl (140 mM), KCl (5 mM), 
MgCl (1mM), CaCl; (2mM), glucose (10 mM) and HEPES (10 mM), adjusted 
to pH 7.4)) was added and the plate incubated for 1h at 37°C/5% CO». The Fura-2 
solution was replaced with 5011 saline solution and baseline fluorescence read- 
ings (excitation 340 nm/emission 520 nm) taken for 10 min using a FlexStation 3 
(Molecular Devices). Ecel peptides were added at different concentrations and 
readings immediately taken for up to 3h. The data was analysed using Softmax 
Pro software to determine calcium present in the cell cytosol and expressed as the 
ratio between excitation and emission spectra. 

Impedance spectroscopy of tethered bilayer lipid membranes (tBLMs). tBLMs 
with 10% tethering lipids and 90% spacer lipids (T10 slides) were formed using the 
solvent exchange technique***” according to the manufacturer's instructions (SDx 
Tethered Membranes Pty Ltd, Sydney, Australia). Briefly, 8 jul of 3mM lipid solu- 
tions in ethanol were added, incubated for 2 min and then 93.4, buffer (100 mM 
KCl, 5mM HEPES, pH 7.0) was added. After rinsing 3x with 1001] buffer the 
conductance and capacitance of the membranes were measured for 20 min before 
injection of Ecel peptides at different concentrations. All experiments were per- 
formed at room temperature. Signals were measured using the tethaPod (SDx 
Tethered Membranes Pty, Sydney, Australia). 

FRET intercalation experiments. Intercalation of Ece1 peptides into phospho- 
lipid liposomes was determined by FRET spectroscopy applied as a probe-dilution 
assay**. Phospholipids mixed with each 1% (mol/mol) of the donor dye NBD- 
phosphatidylethanolamine (NBD-PE) and of the acceptor dye rhodamine-PE, 
were dissolved in chloroform, dried, solubilized in 1 ml buffer (100 mM KCl, 5mM 
HEPES, pH 7.0) by vortexing, sonicated with a titan tip (30 W, Branson sonifier, cell 
disruptor B15), and subjected to three cycles of heating to 60°C and cooling down 
to 4°C, each for 30 min. Lipid samples were stored at 4°C for at least 12h before 
use. Ecel peptide was added to liposomes and intercalation was monitored as the 
increase of the quotient between the donor fluorescence intensity Ip at 531 nm and 
the acceptor intensity I, at 593 nm (FRET signal) independent of time. 

Circular Dichroism spectroscopy. CD measurements were performed using a 
Jasco J-720 spectropolarimeter (Japan Spectroscopic Co., Japan), calibrated as 
described previously*’. CD spectra represent the average of four scans obtained 
by collecting data at 1 nm intervals with a bandwidth of 2nm. The measurements 
were performed in 100 mM KCl, 5mM HEPES, pH 7.0 at 25°C and 40°C ina 
1.0mm quartz cuvette. The Ece1-III concentration was 151M. 

Planar lipid bilayers. Planar lipid bilayers were prepared using the Montal-Mueller 
technique’ as described previously. All measurements were performed in 5mM 
HEPES, 100 mM KCl, pH 7.0 (specific electrical conductivity 17.2 mS per cm) at 
37°C. 

Hyphal secretome preparation for LC-MS/MS analysis. Candida strains were 
cultured for 18h in hyphae inducing conditions (YNB medium containing 2% 
sucrose, 75 mM MOPSO buffer pH 7.2, 5mM N-acetyl-p-glucosamine, 37 °C). 
Hyphal supernatants were collected by filtering through a 0.2\1m PES filter, and 
peptides were enriched by solid phase extraction (SPE) using first C4 and subse- 
quently C18 columns on the C4 flowthrough. After drying in a vacuum centrifuge, 
samples were resolubilized in loading solution (0.2% formic acid in 71:27:2 ACN/ 
H,0/DMSO (v/v/v)) and filtered through a 10 kDa MWCO filter. The filtrate was 
transferred into HPLC vials and injected into the LC-MS/MS system. LC-MS/ 
MS analysis was carried out on an Ultimate 3000 nano RSLC system coupled to a 
QExactive Plus mass spectrometer (ThermoFisher Scientific). Peptide separation 
was performed based on a direct injection setup without peptide trapping using 
an Accucore C4 column as stationary phase and a column oven temperature of 
50°C. The binary mobile phase consisting of A) 0.2% (v/v) formic acid in 95:5 
H,0/DMSO (v/v) and B) 0.2% (v/v) formic acid in 85:10:5 ACN/H;0/DMSO 
(v/v/v) was applied for a 60 min gradient elution: 0-1.5 min at 60% B, 35-45 min 
at 96% B, 45.1-60 min at 60% B. The Nanospray Flex Ion Source (ThermoFisher 
Scientific) provided with a stainless steel emitter was used to generate positively 
charged ions at 2.2kV spray voltage. Precursor ions were measured in full scan 
mode within a mass range of m/z 300-1600 at a resolution of 70k FWHM using 
a maximum injection time of 120 ms and an automatic gain control target of 1e6. 
For data-dependent acquisition, up to 10 most abundant precursor ions per scan 
cycle with an assigned charge state of z= 2-6 were selected in the quadrupole for 
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further fragmentation using an isolation width of m/z 2.0. Fragment ions were 
generated in the HCD cell at a normalized collision energy of 30 V using nitro- 
gen gas. Dynamic exclusion of precursor ions was set to 20s. Fragment ions were 
monitored at a resolution of 17.5k (FWHM) using a maximum injection time of 
120ms and an AGC target of 2e5. 

Protein database search. Thermo raw files were processed by the Proteome 
Discoverer (PD) software v1.4.0.288 (Thermo). Tandem mass spectra were 
searched against the Candida Genome Database (http://www.candidagenome. 
org/download/sequence/C_albicans_SC5314/Assembly22/current/C_albicans_ 
$C5314_A22_current_orf_trans_all.fasta.gz; status: 2015/05/03) using the Sequest 
HT search algorithm. Mass spectra were searched for both unspecific cleavages 
(no enzyme) and tryptic peptides with up to 4 missed cleavages. The precursor 
mass tolerance was set to 10 p.p.m. and the fragment mass tolerance to 0.02 Da. 
Target Decoy PSM Validator node and a reverse decoy database was used for 
(q value) validation of the peptide spectral matches (PSMs) using a strict target 
false discovery (FDR) rate of <1%. Furthermore, we used the score versus charge 
state function of the Sequest engine to filter out insignificant peptide hits (xcorr 
of 2.0 for z=2, 2.25 for z=3, 2.5 for z= 4, 2.75 for z=5, 3.0 for z=6). At least two 
unique peptides per protein were required for positive protein hits. 

Statistics. TransAM and patch clamp data were analysed using a paired t-test while 
cytokines, LDH and calcium influx data were analysed using one-way ANOVA 
with all compared groups passing an equal variance test. Murine in vivo data was 
analysed using the Mann-Whitney test. Zebrafish data was analysed using the 
Kruskal-Wallis test with Dunn’s multiple comparison correction. In all cases, 
P<0.05 was taken to be significant. 
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Extended Data Figure 1 | C. albicans ECE1 expression and phenotypic epithelial cells. d, No difference in adhesion of C. albicans wild-type, 


effects of ECE1 gene deletion. a, Relative expression (vs t=0) of ECE1 ece1A/A and ece1A/A + ECE1 strains to TR146 epithelial cells after 

in C. albicans wild type over time after addition of yeast cells to TR146 60 min. e, No difference in invasion of C. albicans wild-type, ece1A/A 
epithelial cells as measured by RT-qPCR. b, Imaging confirmation of and ece1A/A + ECE] strains into TR146 epithelial cells after 3h. 

ECE] expression over time within C. albicans wild-type strain. C. albicans f, Fluorescence staining of C. albicans wild-type and ece1 A/A hyphae 
cells expressing GFP under the control of the ECE] 5’ intragenic region, invading through TR146 epithelial cells. Fungal cells are stained with 
containing the ECE1 promoter, were grown on TR146 epithelial cells and calcofluor white (CFW, post-permeabilization) and Alexa-Fluor-647- 
stained with calcofluor white (CFW, post-permeabilization) to show cell labelled concanavalin A (ConA, pre-permeabilization) to show cell 

wall chitin and Alexa-Fluor-647-labelled concanavalin A (ConA, pre- wall chitin and carbohydrates, respectively, and to distinguish between 
permeabilization) to show carbohydrates. A composite image showing invading hyphae (only stained after permeabilization) and non-invading 
CFW, ConA, GFP and the brightfield (BF) image is shown. c, Scanning hyphae (stained both pre- and post-permeabilization). Levels of chitin and 
electron micrographs (top panels, 5h) and light microscopy (bottom 8-glucan are comparable in both strains. White arrows indicate invasion 
panels, 24h) showing no gross abnormalities in hypha formation between into epithelial cells. Data shown are representative (b, c, f) or the mean 
C. albicans wild-type (BWP17 + CIp30), ECE1-deletion (ece1A/A) and (a, d, e) of three biological replicates. Error bars show + s.e.m. 


ECE1 re-integrant (ece1 A/A + ECE1) strains after infection of TR146 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | C. albicans Ece1p is critical for mucosal 
virulence in vivo. a, Fungal burdens recovered from the tongues of 

mice infected with C. albicans wild-type (BWP17 + CIp30) (number 

of mice(n) = 13), ECE1-deletion (ecel A/A) (n= 20) and ECE1 re- 
integrant (ecel A/A + ECE1) (n= 24) strains after a 2-day oropharyngeal 
infection. b, Average percentage of the entire tongue epithelium area 
infected in different groups of mice infected with the different C. albicans 
strains. c, Confocal imaging of 4-day post-fertilization (dpf) mpo-gfp 
transgenic zebrafish swimbladders infected with C. albicans wild-type 
(BWP17 + CIp30 + dTomato), ECE1-deletion (ecel A/A + dTomato) 

and ECE] re-integrant (ece1 A/A + ECE1 + dTomato) strains for 24h. 

C. albicans cells appear red while neutrophils appear green. Red dots outline 
the swimbladder. Images are composites of maximum projections in the 
red and green channels (25 slices each, approximately 100 1m depth) with 
(left) or without (right) a single slice in the DIC channel overlay. Scale bars 
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represent 100m. d, Confocal imaging of 4 dpf zebrafish swimbladders 
infected with C. albicans wild-type (BWP17 + CIp30 + dTomato), 
ECE1-deletion (ecel A/A + dTomato) and ECE] re-integrant 

(ecelA/A + ECE1 + dTomato) strains for 24h stained with the fluorescent 
exclusion dye Sytox Green. C. albicans cells appear red and damaged 
epithelial cells appear green. White dots outline the pronephros and 

red dots outline the swimbladder. Images are composites of maximum 
projections in the red and green channels (25 slices each, approximately 
100m depth) with (left) or without (right) a single slice in the DIC 
channel overlay. High magnification images of the white boxes are shown. 
Scale bars (bottom right) represent 100 1m (low magnification) and 30 um 
(high magnification). Data shown are the mean (a, b) or representative 

(c, d) of at least three biological replicates. Error bars show +s.e.m. Data 
were analysed by Mann-Whitney U-test. ***P < 0.001. 
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a 


Candida albicans Ece1p amino acid sequence: 

MKFSKIACATVFALSSQAAT I HHAPEFNMKRDVA PAA PAAPADQAPTVPAPQEFNTAITKRSIIGIIMGILGNIPQVIQIIMSTVRKA 
FKGNKREDIDSVVAGI IADMPFVVRAVDTAMTSVAS TKRDGANDDVANAVVRLPE I VARVATGVOOS I ENAKRDGVPDVGLNLVANA 
PRLISNVFDGVSETVQQAKRDGLEDFLDELLORLPQLITRSAESALKDSQPVKRDAGSVALSNLIKKS IETVGIENAAQIVSERDIS 
SLIEEYFGKA 


Ecet1-l,.31 Ecet-lls261 Ecet-Illgz93 Ecet-IVoq.126 Ece1-Vy27-160 Ecet-Vii61-194 Ece1-Viligs-228 Ecet-VIllzz9.271 


Ecel-I,_3 MKF'SKIACATVFALSSQAAIIHHAPEFNMKR 

Ecel-IIs9 61 DVAPAAPAAPADQAPTVPAPQEFNTAITKR 

Ecel-II¢-93 SIIGIIMGILGNIPQVIQT IMSIVKAFKGNKR 
Ecel-IV9q_126 EDIDSVVAGI LADMPFVVRAVDTAMTS VASTKR 
Ecel-Vy37-160 DGANDDVANAVVRLPE IVARVATGVQQSIENAKR 
Ecel-V1y¢1-19, | DGVPDVGLNLVANAPRLISNVFDGVSETVQQAKR 
Ecel-VIIj55-223 | DGLEDFLDELLORLPOLITRSAESALKDSQPVKR 
Ece1l-VIIIj29-27; DAGSVALSNLIKKSIETVGIENAAQIVSERDISSLIEEYFGKA 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Ece1-III¢2-93 is the active region of Ecelp. 

a, Amino acid sequence of Ecelp and a schematic of the protein, indicating 
the signal peptide (SP), lysine-arginine motifs (KR) at the C terminus 

of each peptide, and the processed peptides (Ece1-I-VII) produced 

by Kex2p cleavage. b, Amino acid sequences of the processed peptides 
(Ece1-I-VIII) produced by Kex2p cleavage. c—e, Induction of GM-CSF (c), 
IL-1a (d) and IL-6 (e) secreted after stimulation of TR146 epithelial 

cells for 24h with varying concentrations of Ece1-III¢2-93 (1.5-70 1M). 

f, Phosphorylation of MKP-1 and c-Fos production after 2h treatment of 
TR146 epithelial cells with 15 |1M of Ece1-III¢2-35 (hydrophobic region), 
Ece1-IIg6-93 (hydrophillic region), Ece1-II1¢2-35 and Ece1-II1g¢_93 together, 
or Ece1-II¢._93 alone. g, Induction of G-CSF secretion after 24h treatment 
of TR146 epithelial cells with 151M of Ece1-II¢2-s5, Ece1-IIIge_93, 
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Ece1-HI¢2-g5 and Ece1-IIIgg_-93 together, or Ece1-III¢2-93 alone. h, Fold 
change induction of LDH release after 24h treatment of TR146 epithelial 
cells with 70 pM. of Ecel-III¢2-g5, Ecel -II[g¢.93, Ecel-II]¢2-85 and Ecel- 
IIIg¢-93 together, or Ece1-III¢2-93 alone. i, Induction of p-MKP-1 and c-Fos 
2h post-infection (p.i.) with the indicated C. albicans strains (MOI = 10). 
j, c-Fos DNA binding induction 3h p.i. with indicated C. albicans strains 
(MOI = 10). k, G-CSF secretion and 1, LDH release 24h p.i. with indicated 
C. albicans strains (MOI = 0.01). Data shown are representative (f, i) or the 
mean (c-e, g, h, j-l) of three biological replicates. Error bars show + s.e.m. 
Data were analysed by one-way ANOVA (c-e, g, h, k-I) or f-test (j). 
*P<0.05, **P< 0.01, ***P < 0.001 (compared with vehicle control). For 
gel image, see Supplementary Fig. 1. 
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Extended Data Figure 4 | Ece1-III¢2-93 is required for C. albicans 
mucosal infection. a, Fungal burdens recovered from the tongues of 
mice infected with C. albicans wild-type (BWP17 + CIp30) (number of 
mice (m) = 13), ECE1-deletion (ece1 A/A) (n=20), ECE1 re-integrant 
(ecelA/A + ECE1) (n= 24) and Ecel -III¢2_93 deletion (ecel A/A + 
ECE1,\ 84-279) (n= 10) strains after 2-day oropharyngeal infection. 

b, Average percentage of the entire tongue epithelium area infected in 
different groups of mice infected with the different C. albicans strains. 
c, Confocal imaging of 4 dpf zebrafish swimbladders infected with 

C. albicans Ece1-II¢2_93 deletion (ecel A/A + ECE1 4184-279 + dTomato) 


% of epithelia infected 


Increased 
Fluorescence magnification 


and ECE] re-integrant (ece] A/A + ECE1 + dTomato) strains for 24h 
stained with the fluorescent exclusion dye Sytox Green. C. albicans cells 
appear red and damaged cells appear green. White dots outline the 
pronephros and red dots outline the swimbladder. Images are composites 
of maximum projections in the red and green channels (25 slices each, 
approximately 100 1m depth) with (left) or without (right) a single slice in 
the DIC channel overlay. Scale bars (bottom right) represent 100 1m. Data 
shown are the mean (a, b) or representative (c), of at least three biological 
replicates. Error bars show + s.e.m. Data were analysed by Mann-Whitney 
U-test. **P < 0.01, ***P < 0.001. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Ece1-III¢2-93 is a cytolytic «-helical peptide. 
a, Circular dichroism spectra showing the a-helical conformation of 
Ece1-III¢2-93 in buffer (100 mM KCl, 5mM HEPES, pH 7). Increasing the 
temperature from 25°C to 40°C did not affect the stability of the a-helical 
structure. b, Diagram to illustrate the amphipathic nature of Ece1-II]¢)_93 
(residues 62-78, left panel; residues 79-93, right panel). Residues with 
hydrophobic or polar/charged side chains are displayed with a blue and 
white background, respectively. Modified from output generated in 


PEPWHEEL (http://emboss.bioinformatics.nl/cgi-bin/emboss/pepwheel). 


c, Forster resonance energy transfer (FRET) experiments show the 
intercalation of Ece1-III¢2_93 into lipid liposomes (101M) composed 
of DOPC in the absence or presence of cholesterol. Peptide titration of 


Ece1-III¢2-93 to liposomes showed slightly enhanced intercalation for 

pure DOPC. d, Ece1-III¢2-93 induced the permeabilization of planar lipid 
membranes composed of DOPC. The graph shows heterogeneous and 
transient lesions leading finally to a rupture of the membrane. Ece1-III¢2-93 
concentration was 0.125 1M. e, Induction of p-MKP-1 and c-Fos 2h in 
TR146 cells post stimulation (p.s.) with Ece1-I]¢2-93xr or Ece1-II]¢2-93aa- 
f, Secretion of G-CSF from TR146 cells 24h p.s. with Ece1-II]¢2-93xr or 
Ece1-III¢2-93,a. Data shown are representative (a—e) or mean (f) of at least 
three biological replicates. Error bars show + s.e.m. Data were analysed by 
one-way ANOVA (f). *P < 0.05, **P < 0.01, ***P < 0.001 (compared with 
vehicle control). For gel images, see Supplementary Fig. 1. 
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Early stage infection/Sub-lytic Ece1-II| concentration Late stage infection/Lytic Ece1-lIl concentration 


C. albicans C. albicans “tH 


Epithelial cell . Epithelial cell 


Immune cytokines (IL-6, G-CSF, GM-CSF) Immune and damage cytokines (IL-1) 
Extended Data Figure 6 | Schematic of the role of Ece1-III in membrane damage and the release of lactate dehydrogenase from the 
C. albicans infection of epithelial cells. a~h, During early stage infection host epithelium (e), concomitant with calcium influx (f). Epithelial signal 
of the mucosal surface by C. albicans, Ece1-III (red «-helix) is secreted transduction is maintained (g), and additionally induces the release of 
into the invasion pocket created by the invading hypha (a), Sub-lytic damage associated cytokines, such as IL-1a (h). Ece1-III may also have 
concentrations of Ece1-III trigger epithelial signal transduction through activity on the epithelial surface outside of the invasion pocket and on 


MAPK, p38/MKP-1 and c-Fos (b), resulting in the production of immune neighbouring cells not in contact with hyphae if Ece1-III is produced in 
regulatory cytokines (c), As the severity of the infection increases, Ecel-III sufficient concentrations. 
accumulates (d), and once lytic concentrations are reached, causes 
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Extended Data Table 1 | C. albicans strains used in this study 


Strain name Strain/Gene Function Strain Morphology* Phospho- c-Fos* Cytokines* Damage Phenotype 
Reference MKP1* Reference 
Controls 
$C5314 Wild type 52 Hyphae Yes Yes Yes Yes This study 
BWP17 & Clp30 Parental strain 53 Hyphae Yes Yes Yes Yes This study 
CAI-4 & Clp10 Parental strain 54 Hyphae Yes Yes Yes Yes This study 
CAF2-1 Parental strain 54) Hyphae Yes Yes Yes Yes This study 
DAY286 Parental strain 55. Hyphae Yes Yes Yes Yes This study 
Yeast-locked 
efg1A/A Transcription factor 56 Yeast No No No No This study 
efgi/cph1A/A Transcription factor/ Transcription factor 57 Yeast No No No No [58]/This study 
eed1A/A RNA polymerase |! regulator 59) Yeast No No No No [58]/This study 
flo84/a Transcription factor (60) Yeast No No No No This study 
tpk14/A cAMP-dependent protein kinase [61 Yeast No No No No This study 
tpk2A/A cAMP-dependent protein kinase (62 Yeast No No No No This study 
vps11A/A Protein trafficking 63 Yeast No No No No This study 
cap1A/A Transcription factor [64 Yeast No No Yes No This study 
och1A/A Alpha-1,6-mannosyltransferase 65. Yeast No No No No [66] 
kex2A/A Processing enzyme 67 Yeast No No No No This study 
Hypha-producing 
nrg1A/a Transcriptional corepressor (68) Hyphae Yes Yes Yes Yes [58]/This study 
cph1a/a Transcription factor [69 Hyphae Yes Yes Yes Yes This study 
cph2A/A Transcription factor 70 Hyphae Yes Yes Yes Yes This study 
efh1a/A Transcription factor (71. Hyphae Yes Yes Yes Yes This study 
czfA/A Transcription factor 72. Hyphae Yes Yes Yes Yes This study 
1fg1A/a Transcriptional repressor (73. Hyphae Yes Yes Yes Yes This study 
hog1A/A MAP kinase 74) Hyphae Yes Yes Yes Yes This study 
sun42A/A Adhesin-like protein (75. Hyphae Yes Yes Yes Yes This study 
pga29A/A GPl-anchored yeast-associated protein (76 Hyphae Yes Yes Yes Yes This study 
phr2h/a Glycosidase rags Hyphae Yes Yes Yes Yes This study 
pga36/A GPl-anchored protein 78) Hyphae Yes Yes Yes Yes This study 
ece1A/A Hypha-associated protein [79]ll Hyphae No No No No This study 
mkc1A/A MAP Kinase (80) Hyphae Yes Yes Yes Yes This study 
bud2A/A GTPase activating protein 81 Hyphae Yes Yes Yes Yes This study 
praiA/A Zinc binding protein 82 Hyphae Yes Yes Yes Yes This study 
utr2/crh11/crh12h/A Putative wall [83 Hyphae Yes Yes Yes Yes This study 
glycosidase/transglycosylase 
wap1A/A Surface antigen on hyphae/buds EZ) Hyphae Yes Yes Yes Yes This study 
sod5A/A Superoxide dismutase 85. Hyphae Yes Yes Yes Yes This study 
hwp1A/A Adhesin [86 Hyphae Yes Yes Yes Yes This study 
rbt1A/A Putative GPl-modified cell wall protein (84) Hyphae Yes Yes Yes Yes This study 
rbtSA/A Heme binding (84) Hyphae Yes Yes Yes Yes This study 
hyr1A/A GPl-anchored hyphal cell wall protein 87 Hyphae Yes Yes Yes Yes This study 
mp65A/A Cell surface mannoprotein (88) Hyphae Yes Yes yes Yes This study 
cek1A/A ERK-family protein kinase (89) Hyphae Yes Yes Yes Yes This study 
sap2A/A Secreted aspartyl protease 90) Hyphae Yes Yes Yes Yes This study 
sap7A/A Secreted aspartyl protease 91 Hyphae Yes Yes Yes Yes This study 
sap9/sap10A/A Secreted aspartyl proteases 92 Hyphae Yes Yes Yes Yes This study 
als1A/A Agglutinin-like sequence protein 93 Hyphae Yes Yes Yes Yes 94) 
als20/ Pua-ALS2 Agglutinin-like sequence protein 95, Hyphae Yes Yes Yes Yes 94) 
als3A/A Adhesin 93 Hyphae Yes Yes Partial Partial" 94) 
als4A/A Agglutinin-like sequence protein 95, Hyphae Yes Yes Yes Yes 94) 
als5A/A Agglutinin-like sequence protein 96 Hyphae Yes Yes Yes Yes 94) 
als6A/A Agglutinin-like sequence protein 96 Hyphae Yes Yes Yes Yes 94) 
als7A/A Agglutinin-like sequence protein 96 Hyphae Yes Yes Yes Yes 94) 
als9A/A Agglutinin-like sequence protein 97) Hyphae Yes Yes Yes Yes 94) 
pmt1a/a Mannosyltransferase (98 Hyphae Partial" Partial" Partial" Partial 66) 
pmr1A/A Secretory pathway ATPase 99) Hyphae Partial" Partial" Partial" Partial" 66) 
mnn4a/a Regulator of mannosylphosphorylation 100 Hyphae Yes Yes Yes Yes 66) 
mnn9A/A Putative mannosyltransferase 101 Hyphae Yes Yes Yes Yes 66) 
mnt1/mnt20/4 Mannosyltransferases 102 Hyphae Yes Yes Yes Yes 66) 
chs2/chs3A/A Chitin synthase/ Chitin synthase 103 Hyphae Yes Yes Yes Yes This study 
mit1A/A Mannose:Inositolphosphoceramide 104 Hyphae Yes Yes Yes Yes 66) 
mannose transferase 
bmt1A/A Beta-mannosyltransferase 105 Hyphae Yes Yes Yes Yes 66) 
bmt2A/4 Putative beta-mannosyltransferase 105 Hyphae Yes Yes Yes Yes 66) 
bmt3A/h Beta-mannosyltransferase 105 Hyphae Yes Yes Yes Yes 66) 
bmt4A/O Beta-mannosyltransferase 105 Hyphae Yes Yes Yes Yes 66) 
bmtSA/A Putative beta-mannosyltransferase 106 Hyphae Yes Yes Yes Yes 66) 
bmt6A/4 Beta-mannosyltransferase 106 Hyphae Yes Yes Yes Yes 66) 
gsc1A/GSC1 Beta-1,3-glucan synthase catalytic 107 Hyphae Yes Yes Yes Yes 66) 
subunit 
gsl1A/A Beta-1,3-glucan synthase subunit 107 Hyphae Yes Yes Yes Yes 66) 
gsl2A/A Beta-1,3-glucan synthase subunit 107 Hyphae Yes Yes Yes Yes 66) 
kre6A/KREG Beta-1,6-glucan synthase subunit 108 Hyphae Yes Yes Yes Yes This study 


References 52-108 are cited in this table. 

*Morphology recorded 2h post-infection on TR146 buccal cell monolayers; hyphae includes pseudohyphae. 

}+Data based on western blotting. 

£Cytokines include IL-1a, IL-6 and G-CSF. 

§Damage measured by LDH assay. 

|| New ece1 A/A also created in this study (see Extended Data Table 2). Original mutant (in red) produced by ref. 27 using the URA-blaster protocol’. A set of ecel mutants, including partial deletion of 
ECE1 and a revertant, was produced in this study in the same genetic background using strain BWP17 to avoid a URA3 effect based on genomic location !09119, 

qPartial activation is due to lack of adhesion. 
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Extended Data Table 2 | C. albicans mutant strains constructed and used in this study 


Strain description 


Strain name 


Genotype 


BWP17+Clp30 


ece1A/A 


ece1A/A+ECE1 


ece1A/AtECE1y 194-279 


kex1A/A 


SC5314+pECE1-GFP (ECE1 promoter-GFP) 
BWP17+Clp30+pENO1-dTom (ENO1 promoter-dTom) 


ece1A/A+pENO1-dTom (ENO1 promoter-dTom) 


ece1A/A+ECE1 + dTomato 


ece1A/AtECE1) 194.279 + dTomato 


M1477 


M2057 


M2059 


M2174 


M2258 


CA58 
RWC83 


RWC84 


RWC85 


RWC86 


ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
RPS1/rps1::(URA3-HIS1-ARG4) 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
ece1::HIS1/ece1::ARG4 
RPS1/rps1::URA3 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
ece1::HIS1/ece1::ARG4 
RPS1/rps1::(URA3-ECE1) 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
ece1::HIS1/ece1::ARG4 
RPS1/rps1::(URA3-ECE1*?”) 
ura3::Aimm434/ura3: :Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
kex1::HIS1/kex1::ARG4 
RPS1/rps1::URA3 
ECE1/ece1::GFP-SAT1 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
RPS1/rps1::(URA3-HIS1-ARG4) 
ENO1/eno1::dTom-SAT1 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
ece1::HIS1/ece1::ARG4 
RPS1/rps1::URA3 
ENO1/eno1::dTom-SAT1 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4::hisG 
ece1::HIS1/ece1::ARG4 
RPS1/rps1::(URA3-ECE1) 
ENO1/eno1::dTomato-NAT 
ura3::Aimm434/ura3::Aimm434 
iro1::Aimm434/iro1::Aimm434 
his1::hisG/his1::hisG 
arg4::hisG/arg4: :hisG 
ece1::HIS1/ece1::ARG4 
RPS1/rps1::(URA3-ECE1*?”) 
ENO1/eno1::dTomato-NAT™ 
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Extended Data Table 3 | LC-MS/MS analysis of C. albicans Ece1-Ill 


Wild Type ece1A/A+ECE1 TR146+Wild type rEce1p+rKex2p kex14/4 


SIIGIIMGILGNIPQVIQIIMSIVKAFKGNK 699 (86%) (41%) 477 (89%) (35%) 79 (97.5%) (97.5%) nid® 49 (13.3%) (3.6%) 


1 (0.1%) (0.06%) 1 (0.2%) (0.07%) 2 (2.5%) (2.5%) 248 (80%) (1.5%) 291 (78.9%) (21%) 


*The number of peptide spectrum matches. Data for ece1 A/A and ece1 A/A + ECE1 aiga-279 are not included as no Ece1-IIl peptides were detected in either strain, as expected. 
tPercentage of SIIGIIMGILGNIPQVIQIIMSIVKAFKGNK or SIIGIIMGILGNIPQVIQIIMSIVKAFKGNKR detected amongst all Ece1-IIl peptides found by LC-MS/MS. 

+Percentage of SIIGIIMGILGNIPQVIQIIMSIVKAFKGNK or SIIGIIMGILGNIPQVIQIIMSIVKAFKGNKR detected amongst all Ecelp peptides found by LC-MS/MS. 

§n/d; not detected. 
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Extended Data Table 4 | Oligonucleotide primers used in this study 


Primer name 


ECE1-FG 


ECE1-RG 


ECE1-RecF3k 
ECE1-RecR 
ECE1-F1 
ECE1-R1 


KEX1-FG 


KEX1-RG 


KEX1-F1 
KEX1-R1 
HIS-F2 
HIS-R2 
ARG-F2 
ARG-R2 
ECE1-Fint1 
URAF2 
RPF-1 

RPF-2 
Pep3-F1 
Pep3-R1 
5'ECE1prom- 
Narl 
3'ECE1prom- 
Xhol 
S'ECE1term- 
Sacll 
S'ECE1term- 


Sacl 
G1-ECE1 


GFP veri rev 
G4-ECE1 
X3-SAT1 
pENO1 FW 
dTom REV 
ACT1-F 
ACT1-R 
ECE1-F 


ECE1-R 


Application 


PCR 


PCR 


PCR 
PCR 
PCR 
PCR 


PCR 


PCR 


PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
PCR 
qPCR 
qPCR 
qPCR 


qPCR 


Sequence (5’-3’) 


atcaaataacccacctatttcaaaattgttttatttttgtttatctctacaaca 
aacaactttcctttattttactaccaactattttccattcgttaaagaagcttc 
gtacgctgcaggtc 
cacaaaaaacaacaattaaaaaaatcagttacagcaaaagtgtcacaag 
acttatggaataaaagattaagcttgtggaaaacaaatttttatctgctgag 
cattctgatatcatcgatgaattcgag 


gcacgcgtctaaagtggagtaacaac 

ggtcgaccccagacgttggttgc 

ggcttctcataaatgaagggctcag 

gccgaatcaatcttgtegtgccac 
tatctttttttgttttatttacccatccttcatatctttacaacccttgatacctt 
acctaaacaacacacatctatctttaatcaatcaaacacaaatcaattgaa 
gcttcgtacgctgcaggtc 
tcacaatctagattattgtaggttgtatagacaaaaaataaaaatcaaact 
attattcgttatataaatctacaagatctctaatctccactgtaccgaaaaat 
tctgatatcatcgatgaattcgag 

ggaagcccataagaaattgga 

aggaagctgtegtggtagts 
ggacgaattgaagaaagctggtgcaaccg 
caacgaaatggcctcccctaccacag 

ggatatgttggctactgatttag 

aatggatcagtggcaccggtg 

ctaacgtttttgatggcgtcctgg 
ggagttggattagatgataaagetgatgg 
gagcagtgtacacacacacatcttg 

cgccaaagagtttcccctattate 

gaagatatcgattctgttgttgctgg 
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tcagctcgagtttaacgaatggaaaatagttgstag 
gatcccgcggcagcagataaaaatttgttttccacaag 
tcaggagctccgttaagaatatgaatgacagttggtc 
ctegctgattagagttcaagagt 

tgatctgggtatctcgcaaagcat 

tggaagattcacttgagttggaac 

gtgaagtgtgaagggggag 

tecttggctggcactgaactcg 

aaggtctaccttcaccttcacc 

tcagaccagctgatttaggtttg 

gtgaacaatggatggaccag 

atcgaaaatgccaagagag 


agcattttcaataccgacag 
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Recent near-Earth supernovae probed by global 
deposition of interstellar radioactive °°Fe 


A. Wallner', J. Feige*+, N. Kinoshita’, M. Paul‘, L. K. Fifield!, R. Golser?, M. Honda®, U. Linnemann’, H. Matsuzaki’, S. Merchel’, 


G. Rugel®, S. G. Tims!, P. Steier?, T. Yamagata’ & S. R. Winkler? 


The rate of supernovae in our local Galactic neighbourhood within a 
distance of about 100 parsecs from Earth is estimated to be one every 
2-4 million years, based on the total rate in the Milky Way (2.00.7 per 
century”). Recent massive-star and supernova activity in Earth’s 
vicinity may be traced by radionuclides with half-lives of up to 
100 million years, if trapped in interstellar dust grains that penetrate 
the Solar System. One such radionuclide is “Fe (with a half-life of 
2.6 million years)”*, which is ejected in supernova explosions and 
winds from massive stars”. Here we report that the “Fe signal 
observed previously in deep-sea crusts!" is global, extended in time 
and of interstellar origin from multiple events. We analysed deep-sea 
archives from all major oceans for “Fe deposition via the accretion of 
interstellar dust particles. Our results reveal “Fe interstellar influxes 
onto Earth at 1.5-3.2 million years ago and at 6.5-8.7 million years ago. 
The signal measured implies that a few per cent of fresh Fe was 
captured in dust and deposited on Earth. Our findings indicate 
multiple supernova and massive-star events during the last ten million 
years at distances of up to 100 parsecs. 

The density and temperature distribution of the interstellar medium 
(ISM) is highly variable, with typical substructures of about 50-150 pc 
(superbubbles) having lifetimes of some ten million years (Myr). Several 
supernova explosions over the last 14 Myr or so shaped the present 
structure of the local superbubble (the Local Bubble)!?“. The Solar 
System, now embedded in the Local Bubble, is expected to have faced 
fronts of supernova ejecta and accumulated material from massive 


stars. To enter the Solar System, any material from the ISM must be 
condensed into larger dust grains to avoid being deflected away by the 
solar wind and interplanetary magnetic field*!°''. ISM dust particles 
were indeed identified at Earth orbit'® and may accumulate on Earth in 
archives such as deep-sea sediments and ferromanganese (FeMn) crusts 
and nodules, which retain time information over millions of years. 60Re 
as well as *°Al (half-life t)/2=0.71 Myr) have been observed!” in the 
ISM as a result of many supernovae and emission from massive stars. 
Direct detection of ‘live’ radionuclides? *!!! on Earth would provide 
insight into recent and nearby nucleosynthesis in massive stars‘*!°"7, 
dust formation and transport into the Solar System. Extraterrestrial ®’Fe 
has in fact already been observed in FeMn crusts in pioneering studies 
at TU Munich’! and is interpreted as being of supernova’™!"!8 or 
(micro) meteoritic origin!?”°, 

Here, we determined the °’Fe contents of three different deep- 
sea archives (four sediment cores, two FeMn crusts and two FeMn 
nodules) recovered from the Indian, Pacific and Atlantic oceans 
respectively (Supplementary Fig. 1). All were dated via their '°Be 
(half-life t,)7 = 1.39 Myr) content, complemented by 7°Al for the 
sediments”!. All radionuclides (Fe, *°Al and !°Be) were counted using 
accelerator mass spectrometry (AMS) (Supplementary Information). 
The sediment cores provided a record from 1.7-3.2 Myr ago with a 
time resolution of <30,000 years (30 kyr), bracketed by recent and 
~5-7-Myr-old samples. The Pacific ‘Crust-1’ extends from the present 
to 10.9 Myr ago with ~2.2-Myr time resolution and ‘Crust-2’ extends 


Table 1 | Averaged ©°Fe/Fe atom ratios from AMS measurements at ANU of the sediment samples 


Background- and 


Time 50Fe 50Fe/Fe decay-corrected  Feconcentration Feconcentration ©Fedeposition Fe deposition 
Sediment period counts (10-15 atoms 60Fe/Fe (10-15 (10-2 grams (104 atoms rates (atoms (10° atomscm~2 
Sediment cores samples (Myr) detected per atom) atoms per atom) per gram) per gram) em yr-4) per layer) 
45-21/50-02 5 <0.2 2 0.06+0.04 0.02+0.02 0.30+0.10 <0.2 <0.2 NA 
49-53/45-21 14 1.71-2.0 123 1.67+0.15 2.52+0.23 0.23+0.01 6.0+0.6 22.8423 6.5+0.7 
49-53/45-21/50-02 11 2.0-2.3 51 1.51+40.21 2.48+0.35 0.24+0.01 6.7+1.0 24.843.6 7TA+#1.1 
49-53/45-21/50-02 vi 2.3-2.6 33 1.964034  3.50+0.61 0.17+0.01 6.5+1.2 27.145.0 8141.5 
49-53/45-16 7 2.6-2.9 54 3.40+0.46 6.6140.90 0.16+0.01 10.341.5 34.845.2 10.4+£1.5 
49-53/45-16 6 2.9-3.18 27 1.18+40.23 2.414047 0.13+0.01 3.440.7 114424 3.0+0.6 
45-16 2 ~4-7" 1 0.114011 0.20+0.30 0.14+0.01 <0.4 <l NA 
Commercial iron 99 Background 7 0.042+0.015 NA NA NA NA NA 
52 sediment samples from four sediment cores (Eltanin) from the Indian Ocean were analysed (individual data are listed in the Supplementary Information) as well as a series of blank samples (com- 
mercial iron). For the measured ©°Fe/Fe ratios, the uncertainties from ®°Fe denote statistical uncertainties only (1c, using Poisson statistics). For background- and decay-corrected °°Fe/Fe data, surface 
and old layers are compatible with the measurement background obtained from chemistry blank samples. The age is based on the 2°Al and !°Be data and tie-points of magnetic reversals. 
The stable iron content of the leached material was measured via inductively coupled plasma mass spectrometry (ICP-MS) (averaged values). The average leachable Fe content for the four cores was 


he Fe deposition equals the terrestrial fluence. 
*Uncertain by ~1 Myr. 
A, not applicable. 


measured to 0.18% (core Eltanin 49-53), 0.25% (45-16), 0.3% (45-21) and 0.45% (50-02). The mean dry density of the sediments was 1.16 gcm~®. For sediments with 100% incorporation efficiency, 
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Table 2 | ©°Fe/Fe ratios from AMS measurements at ANU of the Pacific and Atlantic FeMn samples 


Crust-1 


Background- and 


60Fe 50Fe/Fe decay-corrected Feconcentration ©°Feconcentration ©Feincorporation Fe incorporation 
Depth Time period counts (10-15 atoms 60Fe/Fe (10-15 atoms (10-2 grams (10° atoms per rates (atoms (10° atoms cm-2 
Layer (mm) (Myr) detected per atom) per atom) per gram) gram) cm~? yr!) per layer) 
Layer 1 0-5 0-2.17 23 0.96+0.25 1.19+0.33 12.8+0.1 1.64+0.45 0.72+0.20 1.56+0.43 
Layer 2 5-10 2.17-4.35 74 1.58+0.23 3.60+0.55 11.8+0.1 4.56+0.68 2.00+0.30 4.34+0.65 
Layer 3 10-15 4.35-6.52 2 0.14+0.12 0.36+0.51 11.4+40.1 0.34+40.62 0.19+0.27 0.40+0.59 
Layer 4 15-20 6.52-8.70 26 0.38+0.09 2.49+0.97 13.740.2 3.67+1.43 1.61+0.63 3.49+1.36 
Layer 5 20-25 8.70-10.87 3 0.15+0.11 1.20+1.50 11.2+40.2 1.45+1.82 0.64 40.80 1.38+41.73 
Total for O-4.35 Myr and 6.52-8.70 Myr 123 A NA A A 9.4+40.9 
Crust-2 
Layers 1,2 0-1.0 1.20-1.41 5 1.06+0.48 1.50+0.67 11.6+0.1 1.87 +0.83 1.67+40.75 0.36+0.16 
Layers 3-5 1.0-2.5 1.41-1.73 37 1.28+0.21 2.0140.33 11.9+40.1 2.60+0.43 2.33 £0.38 0.74+40.12 
Layers 6-8 2.5-4.0 1.73-2.05 32 0.84+0.15 1.39+40.25 11.8+0.1 1.73+0.31 1.55+0.27 0.50+0.09 
Layers 9, 10 4.0-5.8  2.05-3.05 20 0.55+0.12 1.09 40.24 11.140.1 1.35+0.30 0.46+0.10 0.46+0.10 
Layers11,12 587.7 3.05-4.11 2 0.12+0.08 0.29+0.21 10.4+0.1 0.37+0.26 0.13 40.09 0.13 40.09 
Layers 13-26 7.7-21.0 4.11-7 1 <0.03 <0.1 9 <0.1 <0.09 <0.06 
Total for 1.20-3.05 Myr 94 A NA NA A NA 2.19+0.22 
Nodule 21 
Layer 1 0-3 0-1.8 3 0.16+0.11 <0.23 15+ <0.4 <0.13 <0.23 
Layer 2 3-6 1.8-3.3 13 0.47+0.16 0.60+0.22 1542 0.97 40.36 0.40+0.15 0.55+0.21 
Layers 3, 4 6-17 3.3-5.4 5: 0.18+0.06 <0.10 1542 <0.16 <0.1 <0.23 
Total for 1.8-3.3 Myr 13 0.47+0.16 0.60+0.22 1542 0.97 40.36 0.40+0.15 0.55+0.21 
Nodule 24 
Layer 1 0-4 0-1.9 15 0.56+0.18 0.71 40.23 1542 1.1340.37 0.34+0.11 0.86+0.41 
Layer 2 4-8 1.9-3.3 5 0.2340.12 0.45 40.23 15+ 0.71+40.37 0.27+0.15 0.54+0.34 
Layers 3,4 8-19 3.3-5.4 1 0.05 +0.05 <0.15 1542 <0.24 <0.1 <0.25 
Total for O-3.3 Myr 20 0.40+0.10 0.58+0.13 1542 0.92+0.21 0.31+40.09 1.40+0.50 
The table shows data for layered samples of the two Pacific FeMn crust samples (Crust-1 and Crust-2) and of the two Atlantic FeMn nodules (numbers 21 and 24). For measured ®°Fe/Fe ratios, 
the uncertainties from ©°Fe denote statistical uncertainties only (10, using Poisson statistics). For background- and decay-corrected ©°Fe/Fe data, the machine background of ®°Fe/Fe=(0.042 +0.015) x 10-15 
was used for the subsequent background correction. Ages are based on the !°Be data and have an uncertainty of + 0.3 Myr (Crust-1) and 40.5 Myr (Crust-2 and nodules). Stable iron content of the 
dissolved crust material and of the leachate in case of the nodules was measured via ICP-MS. The mean dry density of the crust and nodule samples was 1.9 gcm~*. The Fe concentration listed for 


layers 13-26 of Crust-2 is the average; individual data range between 5.5% and 13.1%. ‘Total’ gives the sum of ©°Fe counts detected and the total ©°Fe incorporation of all layers that are significantly 


above the measurement background. 


from 1.2-7 Myr ago (~100-kyr resolution). Two nodules covered a time 
period from the present to 5.4 Myr ago (~2-Myr resolution). 

In the sediment, 288 °’Fe-events were registered for the time period 
1.71-3.18 Myr ago (45 individual samples) with a mean isotopic ratio of 
0Fe/Fe = (1.79 £0.10) x 107}, a factor of ~40 above the measurement 
background of (0.042 + 0.015) x 10~!° (Table 1). None of the recent or 
old sediment samples show evidence for ®°Fe above background (three 
$°Fe-events). The first two layers in Crust-1 gave “Fe-signals 40 and 
7a above background; layers 3 and 5 are close to the measurement 
background, but layer 4, which spans the period 6.5-8.7 Myr ago, has 
a higher ratio (~40 above background, Table 2). For Crust-2 a clear 
®Fe-signal was also found at <3.5 Myr. The nodules support this find- 
ing (Table 3, Supplementary Tables 3-5). 

In summary, two clear “Fe signals with a total of 538 © Fe events were 
observed. In the sediments, the signal covers the time period 1.7-3.2 Myr. 
In the two crusts, “Fe is found up to 3.5 Myr ago and up to about 4 Myr 
ago, respectively, with a second influx between 6.5 Myr and 8.7 Myr ago. 
The nodules confirm the presence of Fe at <3.3 Myr ago. No ®°Fe signal 
is found in recent (<0.2 Myr old) or older (>5 Myr old) sediments and 
nodules, or in crust layers 4.4-6.5 Myr and 8.7-10.9 Myr old. 

Between 1.7 Myr ago and 3.18 Myr ago, the ®’Fe deposition rate into 
the sediments was ~11-35 °°Fe atoms-cm~ yr~! (300-kyr averages), 
whereas incorporation rates into crust material were much lower at 1-2 
atoms cm” yr~! (Fig. 1; all data are decay-corrected). This suggests 
an incorporation efficiency into Crust-1 and Crust-2 of 17% and 7%, 
respectively. The deposition in the 1.5-Myr interval covered by the signal 
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in the sediment is (35 +2) x 10° atoms cm~’. For the second ®°Fe 
signal (6.5-8.7 Myr ago, Crust-1, 17% incorporation efficiency) it is 
(21+6) x 10° atoms cm~? (Tables 2 and 3). 

Although the 1.5-Myr time spread of ®°Fe influx measured in the 
present work exceeds the ~0.8 Myr previously reported for crust 
237KD!"!8 the two time profiles are not inconsistent given the 
lower counting statistics and signal-to-background ratio in ref. 11. 
Furthermore, the marginally positive result for the same time period 
for an Atlantic sediment'® is consistent with our data, considering their 
higher sedimentation rates and stable Fe contents. Fe has also been 
reported in lunar material, though without time information, and 
recently in Pacific sediments”*?". 

Clearly, our data are incompatible with a constant °’Fe production 
or deposition. A terrestrial origin can be ruled out, because there is no 
suitable target for cosmic-ray-induced production and anthropogenic 
input would be concentrated in the surface layer. Since ®Fe was found 
in each of the major oceans, it is reasonable to assume a uniform global 
distribution. A micro-meteoritic or meteoritic origin can be excluded, 
given that the measured cosmic-dust flux is 400 times lower than would 
be required (Supplementary Information and Supplementary Fig. 6). 
Similarly, a hypothetical break-up of a single object, comparable to the 
asteroid invoked in relation to the Cretaceous—Tertiary event 65 Myr 
ago, would have delivered 4,500 times less Fe. 

We assume that the extraterrestrial ®°Fe flux through Earth’s 
cross-section is homogenously distributed over Earth's surface. Thus, 
the measured mean deposition of ~24.5 atoms cm~* yr7! (1.7-3.2 Myr 
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Table 3 | Summary of ©°Fe deposition at various locations 


Deep-sea Time period © Fedetector  ©°Fe deposition 
archive Cores Location (Myr) events (10° atoms cm~2) 
Sediment 4 Indian = 1.71-3.18 288 35.4426 
Ocean 
FeMn Crust-1 2 Pacific 0-4.35 97 5.9+08 
Ocean 
FeMn Crust-1 6.52-8.70 26 3.5414 
FeMn Crust-2 1.2-3.1 94 2.2+0.2 
FeMn nodules 2 Atlantic 1.8-3.3 i3 0.6+0.2 
Ocean 
0-3.3 20 1440.5 
FeMn Mona 1 Pacific 0-5.9 21 ~orelt 
Pihoa!® Ocean 
FeMn 1 Pacific 1.74- 69 1.5+0.4* 
237KD1! Ocean 2.61* 
Lunar 4 Moon Integral - ~10 
material? 


Data were obtained in this work and as given in the literature!®!!,2? (no correction for incorpo- 
ration efficiency). Uncertainties are 1c. For Crust-1 and Crust-2 an incorporation efficiency of 
17% and 7%, respectively, has to be taken into account to calculate the Fe fluence from the 
deposition values; similarly 2% and 4% for the nodules. 

*Adjusted for revised ©°Fe and !°Be half-life values”®. 

TNot listed in ref. 22. 

+Background-corrected, adjusted for revised ©°Fe half-life and interpolation between the two layers. 


signal) corresponds to a ®Fe flux of 98 atoms cm~? yr~! into the 
inner Solar System or integrated over 1.5 Myr to an °°Fe fluence of 
(1.46 +£0.15) x 10° atoms cm~? at Earth orbit; the fluence for the older 
event is (1.2 +0.4) x 10° atoms cm~*. Interstellar grains, filtered by the 
Solar System in size to an average of ~0.5 1m, were detected by space 
missions!, suggesting that (6 +3)% of the mass of ISM dust reaches 
the inner Solar System®. These grains follow the flow velocity of the ISM. 

Assuming that the ®’Fe-loaded grains follow the same mass dis- 
tribution as determined for ISM grains at Earth orbit, we deduce 
an interstellar ©°Fe concentration in dust of (2.8 + 1.4) x 107!! Be 
atoms cm~? for 1.7-3.2 Myr and, integrated over the full period of 
11 Myr, an average concentration of ~(5-15) x 107~'? atoms cm~?. 
Observations of °°Fe decay’? and nucleosynthesis models” suggest 
an average Galaxy concentration of ~6 x 107'* Fe atoms cm™? 
(Supplementary Information), in agreement with the 11-Myr period 
of local data reported here. 

6°Fe is produced in massive stars””*-”” in their late phases, predom- 
inantly just before supernova explosions, and then ejected into space. 
(Super)asymptotic-giant-branch stars also produce and eject °°Fe 
through their stellar winds for a period of about 50 kyr, leading to a 
time profile similar to supernovae; however, their contribution to the 
Galactic ®’Fe inventory is small*®. 

Models suggest a travel time of about 200 kyr with a time spread of 
approximately 100-400 kyr (ref. 5) for ejecta from a single supernova 
at a distance from Earth of about 100 pc. Our measured spread of 
about 1.5 Myr is inconsistent with the interpretation in terms of ejecta 
from a single supernova (or asymptotic-giant-branch star) moving 
across the Solar System (Supplementary Fig. 6). It suggests multiple 
supernova and massive-star events within the last ten million years 
or so in Earth’s vicinity and during two distinct periods 1.7-3.2 Myr 
ago and around 6.5-8.7 Myr ago. The recent time profile would be 
compatible with movement across the Solar System of ejecta in a series 
of supernova fronts in short succession within 1.5 Myr. 

This would, however, require a high frequency of supernovae 
(around two to three supernovae per million years), since large fluc- 
tuations were not observed in the time profile. Alternatively, the 
ejecta containing the ®Fe-bearing grains could have come to rest in 
the ambient ISM and diffused into volumes or clouds that were then 
traversed by the Solar System!*. 

The Solar System is currently embedded in a flow of ISM material 
with interstellar grains moving parallel to the flow of neutral interstellar 
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Figure 1 | Deposition rates for sediment (150-kyr averaged data) and 
incorporation rates for two crust samples. °’Fe concentrations (*°Fe 
per gram) for the sediment are given in the inset; they were on average 
6.7 x 10* atoms per gram between 1.7 Myr and 3.2 Myr, but 260 x 104 
atoms per gram of crust and 95 x 104 atoms per gram of nodule, 
reflecting the difference in growth rate and incorporation efficiency 
(see Supplementary Information). The error bars (1o Poisson statistics) 
include all uncertainties and scale with decay correction, so that 
uncertainties and upper limits become larger for older samples. The 
absolute ages for the sediment samples have an uncertainty of 0.1 Myr, 
except for the 5.5-Myr-old sediments, which have an uncertainty of about 
1 Myr. The age of Crust-1 has an uncertainty of 0.3 Myr and the age of 
Crust-2 has an uncertainty of 0.5 Myr. 


gas in local ISM clouds, which suggests a common history or driver’. 


Such clouds have been suggested as part of an expanding superbubble 
shell driven by supernovae and winds from massive stars!*"'*°. 

Assuming the ejecta originate from a distance of 70-100 pc (the 
approximate limit of the Local Bubble) and that °°Fe is equally distrib- 
uted into the outer shell of size 30 pc (a distance representing 1.5 Myr 
of travel), that is, assuming a spherical shell of mean radius 70-100 pc 
with a thickness of 30 pc, we deduce a total ®’Fe mass trapped in ISM 
dust of (5-11) x 10~° solar masses (Mo) in the shell volume. This 
number represents a lower limit because it reflects the fraction of “Fe 
condensed into dust without correction for radioactive decay and 
neglects the granularity of clumpy ejecta. Models predict core-col- 
lapse and electron-capture supernova nucleosynthesis yields for °°Fe 
to be (0.5-14) x 10-°Mo for stars of mass 8Mo-25Mo, depending 
on the progenitor mass, and with large uncertainties in the nuclear 
physics input”*?”. (Super)asymptotic-giant-branch stars produce 
(0.003-1) x 107° Mg *°Fe”®. Our observed signals therefore favour 
supernova events. The fraction of Fe in dust can be roughly estimated 
by a comparison of our measured °Fe deposition with nucleosynthesis 
yields. Under these assumptions and assuming reasonable distances 
(20-100 pc) about 0.4% to 9% of ®°Fe would have been trapped in 
ISM dust particles (Supplementary Information, Supplementary Figs 
7 and 8). 

Comparing our data with a similar work for ISM *“*Pu in sediments 
and crust samples® yields a “#Pu/®Fe atomic ratio of <2-3 x 107° (20) 
during periods of elevated ®’Fe deposition over the past ten million 
years, which agrees with the recently reported low *“*Pu supernova 
yields® (Supplementary Information). 

This broad and global ®’Fe influx on Earth demonstrates recent (within 
the past ten million years) and widespread massive-star ejections in our 
near Galactic neighbourhood (less than 100 pc from Earth), most probably 
from supernova explosions. Interestingly, the older event coincides with a 
strong increase in *He and temperature change at about 8 Myr ago””, while 
the more recent activity starting about 3 Myr ago occurred at the same 
time as Earth’ temperature started to decrease during the Plio—Pleistocene 
transition. 
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The locations of recent supernovae near the Sun 
from modelling °°Fe transport 


D. Breitschwerdt!, J. Feige!, M. M. Schulreich!, M. A. de. Avillez)?, C. Dettbarn’ & B. Fuchs? 


The signature of Fe in deep-sea crusts indicates that one or 
more supernovae exploded in the solar neighbourhood about 
2.2 million years ago’. Recent isotopic analysis is consistent with 
a core-collapse or electron-capture supernova that occurred 60 to 
130 parsecs from the Sun®. Moreover, peculiarities in the cosmic 
ray spectrum point to a nearby supernova about two million years 
ago®. The Local Bubble of hot, diffuse plasma, in which the Solar 
System is embedded, originated from 14 to 20 supernovae within a 
moving group, whose surviving members are now in the Scorpius- 
Centaurus stellar association””®. Here we report calculations of the 
most probable trajectories and masses of the supernova progenitors, 
and hence their explosion times and sites. The Fe signal arises 
from two supernovae at distances between 90 and 100 parsecs. 
The closest occurred 2.3 million years ago at present-day galactic 
coordinates |= 327°, b= 11°, and the second-closest exploded about 
1.5 million years ago at 1= 343°, b= 25°, with masses of 9.2 and 8.8 
times the solar mass, respectively. The remaining supernovae, which 
formed the Local Bubble, contribute to a smaller extent because 
they happened at larger distances and longer ago (°Fe has a half- 
life of 2.6 million years®'°). There are uncertainties relating to the 
nucleosynthesis yields and the loss of Fe during transport, but 
they do not influence the relative distribution of “Fe in the crust 
layers, and therefore our model reproduces the measured relative 
abundances very well. 

It has been shown that supernovae from a moving stellar group, 
whose surviving members are now in the Sco—Cen association, were 
able to generate the Local Bubble”!’. This hypothesis was further 
tested by performing three-dimensional high-resolution numerical 
simulations to follow the evolution of the Local Bubble from its 
formation until today and even into the future’’. It was found that 
the bubble’s extension, as well as its ion column density ratios!>14, 
matched ultraviolet observations (for example, FUSE’*) remarkably 
well. To link the Local Bubble to the amount of ®°Fe discovered in a 
deep-sea ferromanganese crust’, it was necessary to determine the 
distances to the supernovae. Therefore the masses of the supernova 
progenitors were calculated by exploiting the empirical fact that 
stars in clusters are likely to have a universal distribution in numbers 
over mass (initial mass function, IMF). We employed a variable- 
size binning to decrease the statistical bias in small samples by 
placing one star into each bin!®, implying that 16 supernovae have 
exploded during the past 13 million years within the boundaries 
of the Local Bubble. Having derived the masses of the supernova 
progenitors, and assuming that all stars in the moving group were 
born at the same time, we were able to extract the age of the cluster” 
from a comparison of the stars’ positions in a Hertzsprung—Russell 
diagram with isochrones from stellar evolution calculations'”. Since 
the main-sequence lifetime ofa star is determined by its stellar mass 
for similar metallicities, as in a cluster, the explosion times are given 
by the difference of the cluster age and the respective lifespan of a 
supernova progenitor. 


To pin down the explosion sites, we had to calculate the most probable 
trajectories of the group member stars, given the errors in the Hipparcos 
data!® and in the radial velocities. The uncertainties of stellar positions 
increase with distances, that is, when traced backwards in time from 
their current location in the Sco—Cen association. Since these uncer- 
tainties are measurement errors that are statistically independent, we 
can assume them to be Gaussian. This results in probability clouds (one 
for each explosion time) from which we extract the locations of highest 
probability and identify them as the explosion sites of the perished stars 
(see Fig. 1 for the two most recent supernovae). Thus we determined 
the present-day Galactic coordinates where the explosions occurred, 
with the two closest near 1 = 327°, b=11° and 1= 343°, b=25° at 
distances of 90-100 pc (see Extended Data Table 1). 

The transport of °°Fe is strongly affected by the densities, velocities 
and pressures, both thermal and turbulent, of the ambient interstellar 
medium in general, and the Local Bubble in particular. Observationally, 
it was discovered from absorption in X-ray maps that the Local Bubble 
is interacting with the neighbouring Loop I bubble’, which restricts 
its expansion by counter-pressure. Hence it was necessary to produce 
a detailed account of the stellar content and the number of supernovae, 
as well as their explosion times and sites in Loop I. An analysis similar 
to that for the Local Bubble’ was carried out, but for a much larger 
volume of 800 pc in diameter. We found that two star clusters, Tr 10 
and the association Vel OB2, have recently passed through the present 
volume of Loop I. Given a sample of 80 stars that might have entered 
this volume about 12.3 million years (Myr) ago, we estimated from an 
appropriate IMF” that, since then, 19 of the most massive stars should 
already have exploded. We derived the initial masses of those stars (and 
hence their explosion times and total ejecta mass) by assigning one 
star to each mass interval of the IMF as the statistically most probable 
distribution!®. By the same token, the supernova explosion sites result 
from the trajectories of all the progenitor stars, which lie within the 
present Loop I. 

To explain the deposition of Fe on the ocean floor, we calculated 
both analytically and numerically the transport of °°Fe in supernova 
blast waves from their sites of explosion to Earth. The Local Bubble shell 
evolution can be described analytically by a wind solution”!”’, with the 
density within the bubble increasing sharply proportional to r”” or even 
more steeply (where r is the distance to the explosion site). We followed 
the expansion of shells in the interior of the Local Bubble, applying 
the Kahn approximation”? (Fig. 2). Using a standard Sedov-Taylor™*”° 
solution for multiple explosions would give incorrect results, because 
the assumptions of negligible ambient pressure and constant density are 
no longer fulfilled. To estimate the abundance of 60Fe in the crust, stellar 
yields of (2-6) x 10-°Mg (where Mog is the solar mass) from nucleosyn- 
thesis models were used (see Methods). This range includes the yield 
proposed for electron-capture supernovae” of 3.6 x 10~°Mo, which, 
together with their low progenitor masses, makes them prime candi- 
dates for the bulk of the ®’Fe deposition. The most likely mechanism of 


Fe penetrating the heliosphere is its condensation into dust grains””. 
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LCC t = -2.3 Myr 


UCL t =-1.5 Myr 


Figure 1 | Probability clouds of the two most 
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The amount of dust, which forms in supernova remnants, is subject to 
potential losses during its transport to Earth. We have therefore adopted 
the value of U = 0.6% (ref. 2) for the ©Fe survival fraction, which is 
similar to the value used by other authors’. 

To determine the details of the contribution and the timescales of the 
Fe deposition in the crust that is due to each individual supernova 
explosion, we performed three-dimensional high-resolution numerical 
hydrodynamics simulations with adaptive mesh refinement of the joint 


evolution of the Local Bubble and Loop I. We allow the mass, momen- 
tum, energy and “Fe of each supernova to be directly injected into the 
numerical grid. As a background interstellar medium we have different 
homogeneous (down to sub-parsec resolution) and inhomogeneous 
turbulent supernova-driven interstellar environments, into which both 
superbubbles expand. 

The dynamics of “Fe is followed via so-called passive scalars or trac- 
ers, which are quantities that behave like a drop of ink when dispersed 
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Figure 2 | Analytical model of ©Fe-carrying 
shells at 1.4 Myr and 2.2 Myr before present. 

The outer black shell results from the Local Bubble 
evolution?!”, The inner shells correspond to 
single remnants of stellar explosions (companion 
stars are now in UCL (blue) and LCC (red)), taking 
about 0.1 Myr to reach Earth. The blue squares 
and red triangles show the explosion sites of the 
UCL and LCC stars, respectively; the black cross 
denotes the position of the Sun. The layout of the 
panels corresponds to Fig. 1. 
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ina liquid. Figure 3 displays °°Fe mass density distributions associated 
with the two superbubbles for two possible background media. By 
tagging the °°Fe-enriched gas, we measured the flux of ©°Fe atoms 
at Earth’s orbit over the whole simulation time, and then smeared 
it out into bins corresponding to the time resolution of the crust 
measurements. 

Results for the homogeneous and inhomogeneous background 
media are presented in Fig. 4. Although in both cases the timing and 
intensity of the observed ®’Fe excess are well reproduced, the underly- 
ing physical processes are different. In the inhomogeneous case (solid 
purple line, Fig. 4b), the maximum signal is due to two individual 
supernova events (numbers 14 and 15 out of a total of 16; see Fig. 4a), 
which cross Earth’s orbit twice as a result of shock reflection from the 
Local Bubble’s outer shell. In the model with the homogeneous back- 
ground medium (dashed orange line, Fig. 4b), it is the supershell of 
the Local Bubble itself that delivers the live ®’Fe content of all previous 
supernovae (numbers | to 15) at once (Fig. 4a), thereby stretching the 
deposition over several hundred thousand years. 

Of all the simulations the model that fits the data best consists of an 
average ambient density of 0.3cm~%. The explosion sequence leads to the 
formation of the Local Bubble shell, transporting the highest °’Fe con- 
tent to Earth. The most recent explosion produces a much narrower but 
higher peak, resulting in a lower ®Fe abundance at earlier times (Fig. 4). 
This is confirmed by our analytical studies with the same density, which 
show the distribution of ©’Fe-carrying shells for two specific times (Figs. 2 
and 4b). Here, too, the Local Bubble shell reaches the Solar System at the 
time of the measured Fe maximum, with a single supernova remnant 
shell following at later times. We find that the last two (three) superno- 
vae contribute 47% (62%) to the amount of °’Fe in the crust. Further 
evidence that the three latest explosions, between 1.5 Myr ago and 
2.6 Myr ago (Extended Data Table 1), are electron-capture supernovae 
(with 3 x 10°’ erg each) is given by recent analyses of the data obtained 
with the Payload for Antimatter Matter Exploration and Light-nuclei 
Astrophysics (PAMELA; http://pamela.roma2.infn.it) satellite experi- 
ment. These show that the excess of cosmic-ray protons, antiprotons and 
positrons in the energy spectra points towards a total energy injection of 
10° erg about 2 Myr ago. These cosmic rays may also have produced an 
excess of long-lived radionuclides in meteoritic particles. 

The absolute, but not relative, values of the Fe distribution in the 
crust layers depend mainly on two factors, which are beyond our calcu- 
lations, and serve as input parameters: (1) the ®’Fe yields of supernovae, 
which are mass dependent and taken from stellar evolution calcula- 
tions, and (2) the transport factors, which describe the deposition of 
Re in the crust. Once better values become available, our simulations 
can be further constrained and refined. The calculations reported here 
show that it is possible to form a coherent picture of the evolution of 
the Local Bubble and the Fe deposition in the crust that is consistent 
with observations. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Analytic modelling. In the following we focus on the supernovae closest to Earth. 
Of course, on a considerably larger scale than 100 pc, more recent (<300,000 yr ago) 
explosions have occurred, such as Vela and Geminga, which may have left signa- 
tures in 4C, cosmic rays and Earth’s biosphere***), but not in Fe owing to the 
large distances from Earth. 

In case of the nearby stellar moving group, we closely examined each star’s 
velocity, including its uncertainty, and calculated the most probable path for the 
group inside the Local Bubble. Since the progenitors could not be observed directly, 
we assume them to have followed this trajectory together with the other group 
members. The putative explosion sites were calculated for each star (see Fig. 1 
and Extended Data Table 1). Coordinates and velocities from still-existing stars of 
subgroups UCL and LCC were inserted into the epicyclic equations of motion”? 
to trace their trajectories back in time’. The cumulative distribution function can 
be written as 


F= FolanF vert (1) 


where we assume the planar and vertical stellar motions to be statistically inde- 
pendent and therefore decoupled. Hence, the components of the distribution 
function are described by 
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where Xobs; Zobs» Uobs» Vobs: ANd Wops are the observed quantities of the positional and 
velocity components’. Here x, y and z denote the direction towards the Galactic 
Centre, the Galactic rotation and the North Galactic Pole, respectively. The asso- 
ciated o; are the corresponding Gaussian uncertainties. The positional error of 
the y-component, 0), is small compared to the error of the x-component, that is, 
Yo = Yobs, and is therefore neglected. The conditional probability p is given by a 
6 distribution 


P(x, y|0> Yor Uo» Vo) = 4(x — xr) 6(y — y) 


p(zlwo,20) = 52-21) (4) 


The index ¢ denotes the components of stellar motion at time t calculated by the 
epicyclic equations*?”*. 

The observed uncertainties increase with distance, that is, when traced back- 
wards in time. This results in probability clouds for each explosion time, from 
which a location of highest probability is extracted and assumed to be the explo- 
sion site of the perished star. Figure 1 depicts such clouds for the two most recent 
supernovae. 

The radius Ry» of the outer Local Bubble shell at time ¢ after the explosion of 
the first supernova was calculated by”! 


1/5 
Ryp= 1 Ms B/ parsecs (5) 
no 
Here, N, is the number of supernovae, Es) is the explosion energy in units of 
10°! erg, no is the outer interstellar gas number density in units of cm~°, into which 
the shell is expanding, and tf, is the evolution time in units of 10’ yr. For densities 
higher than 0.3 cm~°, the shell is not able to reach the Solar System because, in the 
analytical model, the first supernova had to be assumed to explode in the Local 
Bubble centre. The same also happens in the numerical simulation (although no 
restriction about the first explosion site had to be made) of the Local Bubble expan- 
sion into a homogeneous medium; hence this interstellar gas density is chosen as an 
upper limit. The free expansion of an individual shell, calculated in the analytical 
model, occurred into a low-density Local Bubble medium”! of 5 x 10-3 cm~?. 
This phase lasts approximately 6,000 yr and covers a distance of about 25 pc, with 
variations depending on the ejected mass. The subsequent energy-driven expan- 
sion is calculated by an approximation due to Kahn”’, taking into account the 
density gradient and the counter-pressure of the ambient bubble medium 
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where the density profile of the Local Bubble medium is p= r" with n= 9/2 (for 
a single supernova shell) and Egy is the supernova explosion energy. The time- 
dependent expression for {2 includes the density no of the ambient medium and 
the beginning of the energy-driven supernova remnant evolution. When written 
in the dimensionless units introduced above, equation (6) reads as 


Fe, 2/19 
Rxahn = ed i parsecs (7) 


no 


To obtain the correct hierarchy of shells, it is required that the radius Rsnr of each 
supernova remnant obeys at all times and positions 


Rsyr= min(Rip, Rxahn) (8) 


indicating that equation (7) only holds until an interior supernova shell hits the 
outer Local Bubble shell. 

The amount of ejected °°Fe was estimated using several nucleosynthesis 
models***”, Depending on the input (such as cross-sections, mass loss and rotation) 
into these models, the “Fe yield is scattered over a wide range between 10 Mo 
and 10° 4Mo. The mass-dependent yields were plotted and fitted by an exponential 
function. Thus, for each initial mass of an exploded star a yield of the order of 10~° 
was obtained (see Extended Data Table 1 and Extended Data Fig. 1). 

The Fe fluence, which is the number of “Fe atoms per cm’ that is incorporated 
into the ferromanganese crust, is calculated by*® 


Fa U Mej e-? In(2)/t1/2 (9) 
4 4nAmpr* 
where r is the distance of the supernova explosion to the Solar System, M,,; is the 
ejected Fe mass, A its atomic mass number, mp the proton mass, and U the uptake 
factor. U is divided by a factor of 4 to account for the ratio of Earth’s cross-section 
to its surface area. The formula includes the exponential decay of “Fe atoms with 
time ¢, where ty/2 is its half-life. The fluence is converted into °Fe/Fe ratios by 
normalizing it to the amount of stable Fe in the crust of 15.27 wt% (ref. 39). 

Finally, only a fraction of Fe, the so-called uptake factor U, is eventually 

incorporated into the crust. An earlier proposed value of U was about 0.6% 
(ref. 2), but more recent analyses point to a much higher value“. In our models 
we have adopted the original value, which takes into account not only the uptake 
into the crust, but further losses during transport and the yield uncertainties. It 
may therefore be interpreted as the 0Fe survival fraction, and is close to the value 
of 0.5% (ref. 5). 
Setup of the numerical model. The numerical simulations have been performed 
with a modified version of the massively parallel multi-purpose astrophysical fluid 
code RAMSES"". It uses a second-order unsplit Godunov scheme for solving the 
hydrodynamical equations on an adaptive octree grid, and a Particle-Mesh solver 
for computing the trajectories of collisionless particles, which represent individual 
massive stars in our setup (see below). The computational domain is cubic with a 
side length of 3 kpc. In the model with the homogeneous background medium, all 
boundaries are treated as periodic, whereas in the model with the inhomogeneous 
background medium periodic and outflow boundaries are applied at the verti- 
cal and horizontal sides, respectively. The inhomogeneous background medium 
is self-consistently generated by exposing an initial interstellar gas distribution, 
derived from observations”, for 180 Myr (a timescale comparable to that required 
to reach a dynamical equilibrium”) to the combined effects of the Galactic gravita- 
tional field‘, heating by the interstellar radiation field and cooling by collisional 
ionization equilibrium processes (all modelled using the spectral synthesis code 
CLOUDY“), as well as winds of massive stars, and supernovae. 

Our simulations currently neglect magnetic fields, because their dynamical 
effects are, when compared to turbulence, presumably small on the scale of the 
dynamical evolution of the isolated patch of interstellar matter*’, and in particular 
the Local Bubble. In the case of the Local Bubble, the magnetic field should be 
concentrated in the supershell, where it has been swept, together with the matter, by 
successive supernova explosions. The Local Bubble interior should thus be almost 
field-free. Stars in the mass range considered of 0.5 < M/Mg < 150 are assumed to 
form at a Galactic rate” when the gas in a computational cell exceeds (falls below) 
a certain density (temperature) threshold, with the initial stellar mass spectrum 
obeying an IMF“*. However, only candidates for Type II supernovae (10 < M/Mo < 30) 
are converted into actual star particles, which is certainly justified for the young 
moving groups, which generated the Local Bubble and Loop I. Both their 
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(main-sequence) lifetimes and their mass loss (due to stellar wind and the super- 
nova) are taken to be functions of their initial masses*”. The stellar wind velocity 
is assumed to be constant and is set equal to the canonical value of 2,000kms~’. 
Freshly spawned stellar particles receive in addition a random drift velocity”? of 
5kms ', to account for the observed velocity dispersion. Owing to the high com- 
putational demands of such a complex simulation, the finest numerical grid is here 
somewhat coarser (2.9 pc) than in the model with the homogeneous background 
medium. As was tested by means of calculating filling factors and probability dis- 
tribution functions, the inhomogeneous medium, however, serves as a much more 
realistic environment for studying the Local Bubble evolution scenario. 

Using a model for non-self-gravitating compressible turbulence*, we analysed 
the properties of turbulence in our simulated patch of the Galactic disk. We also! 
found that turbulence is injected at the scales of breaking open superbubbles, and, 
furthermore, that the structure functions up to higher orders are nicely matched 
by a scaling law specifically derived for supersonic turbulence, in which shocks 
represent the most dissipative structures”. 

As noted before, °’Fe is treated as a passive scalar, which is justified by its low 
concentration, precluding any relevant back-reactions on the flow (for example, 
buoyancy). The spatiotemporal evolution of the “Fe distribution can therefore be 
approximately described by an advection-diffusion equation of the form*? 


(10) 


where Z is the ©’Fe mass fraction, u is the fluid velocity, and a is the diffusivity 
of the contaminant (which is assumed here to be isotropic). Besides taking into 
account the radioactive decay of Fe (using its latest derived half-life”!), we have 
actively modelled only the left-hand side of this equation, which is motivated by 
the fact that, particularly in the interstellar medium, diffusive effects are usually 
restricted to the very small scale (the so-called microscale) of turbulence. However, 
our simulations still feature non-zero diffusion of Fe arising from the numerical 
scheme itself. Since such numerical diffusion has been demonstrated to operate in 
general faster and on larger scales than its physical counterpart™, the timescale of 
mixing in our simulations represents a lower limit to the mixing timescale resulting 
from physical diffusion. 

Code availability. The RAMSES code is available for free download online (https:// 
bitbucket.org/rteyssie/ramses). The newly written or modified routines of the code, 
as well as the analytical model, which was programmed in MATHEMATICA, that 
are required for producing the specific setup and results discussed here, are still 
being improved and extended, and will therefore be released in the future. 
Modelling 7A] and *Mn signals in the ferromanganese crust. “Fe is the most 
suitable long-lived radionuclide for detecting recent nearby supernova explosions 
because (1) it is not produced by terrestrial sources, (2) its extraterrestrial influx via 
interstellar dust particles and micrometeorites is negligible, and (3) all primordial 
Fe has decayed since the formation of the Solar System. 

Here we present the distributions of two other candidate isotopes, 
?6 Al (ty; =0.7 Myr) and Mn (t1/2=3.7 Myr), in the ferromanganese crust. Again, 
we performed analytical calculations using the supernova remnant expansion 
model described by Kahn”, and assume that the outer boundary evolves as a super- 
bubble shell”. Therefore, stellar explosion times and arrival times are identical 
to the “Fe model. In addition, the survival fraction of 0.6% was adopted from this 
model. Here, to be conservative, we used the highest possible yields for *°Al and 
°3Min (see Extended Data Figs 2 and 3)°>"*. 

Furthermore, relative supernova abundances of *Fe/”°Al are suggested* to 
lie between 0.6 and 23. As a second approach, we have used the upper and lower 
limits of these ratios to scale the measured Fe signal in the 1.7-2.6-Myr-old layer 
of the ferromanganese crust and estimate the expected 7°Al abundance, taking into 
account the different half-lives. 

In this case it is important to consider the large atmospheric production of 
?6 A], which arises from spallation of mainly Ar atoms by cosmic rays in Earth’s 
atmosphere. The deposition of *°Al onto the layers is displayed in Extended Data 
Fig. 4. The concentration of the radionuclide is normalized to the stable Al con- 
tent of 0.6 wt% in the crust (K. Knie, private communication, November 2015). 
The terrestrial *°Al background, usually of the order of 6A1/?7Al = 10- at the 
surface*’, overwhelms any possible supernova signal in all scenarios considered 
here. Clearly, a detection of a *°Al supernova peak in the crust is challenging, since 
the accelerator mass spectrometry measurement uncertainties are larger than the 
modelled supernova signature. The *°Al estimate presented here is consistent with the 
26A] value measured in the ferromanganese crust (K. Knie, private communication, 
November 2015) as well as with recent measurements performed in deep-sea 
sediments, where no supernova signal was found®. 

The situation is similar for **Mn. Extraterrestrial interstellar dust particles and 
micrometeoritic influx may distribute *Mn on Earth and hide a potential supernova 


signal (Extended Data Fig. 5). The *Mn concentration was normalized to the stable 
°°Mn content of 22.67 wt% and the results are in agreement with *Mn accelerator 
mass spectrometry measurements performed in the ferromanganese crust®”. Again, 
the measurement error bars are larger than the calculated supernova signal and 
therefore the extraterrestrial background hampers the detection of supernova- 
produced Mn. 
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Extended Data Figure 1 | Fe yields from various nucleosynthesis 
models**~*’. Given that the yields are mass-dependent, an average between 
the highest*” and lowest*® was calculated and extrapolated towards the 
lower mass range. Superscripts in the legend refer to reference numbers. 
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Extended Data Figure 2 | 7°Al yields from various nucleosynthesis 
models**~*”. To be conservative, the highest yields** were fitted and 
extrapolated towards the lower mass range. 
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Extended Data Figure 3 | °*Mn yields from various nucleosynthesis 
models***>*”, Again, to be conservative, the highest yields** were fitted 
and extrapolated towards the lower mass range. 
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Extended Data Figure 4 | *°Al in the ferromanganese crust. The largest 
fraction arises from atmospheric production, hiding a possible supernova 
signal. The arrival times of the supernova shells (analytical model with 
input parameters identical to the °’Fe model) are indicated by a blue square 
(UCL) and a green dot (LB, Local Bubble shell), whereas the blue and 
green shaded histograms represent the amount of *°Al deposited onto the 
crust, normalized to *”Al. The resulting supernova signal (SN) is added 

to the atmospheric background (black dashed line). The measured Fe 
concentration” in the layer showing the largest peak (red (blue) dashed 
lines) was used to scale towards the *°Al content using supernova °Fe/*°Al 
ratios®® of 0.6 (23). Here, the atmospheric background has been added to 
the signal. 
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Extended Data Figure 5 | °*Mn in the ferromanganese crust. The largest 
fraction is produced from extraterrestrial dust and micrometeorite influx, 
hiding a possible supernova signal. Again, the analytical model with input 
parameters identical to the ©’Fe model was used. The notation is the same 
as in Fig. 4. 
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Extended Data Table 1 | Supernova explosions that created the Local Bubble 


ts m(Mo) Meg (10->Mo) x(pc) ype) z(pe) Di(pc) 7(°) b(°) @ 5 sc 
12.62 19.86 6.3 277 75 89 300 15.15 17.23 17517 = -7°09" =Oph 
12.09 18.61 5.5 223 99 71 254 23.94 16.22 179377 -9°21™ Oph 
-11.32 17.34 5.0 251 67 87 274 14.95 18.52 17542™ -6°39" Oph 
-10.02 15.41 4.2 227 57 83 248 14.10 19.53 17077 -6°48" Oph 
-10.03 15.36 4.1 185 TL. 67 211 22.60 18.49 179277 -0°23™ Oph 
8.77 — 13.89 3.6 203 45 79 222 12.50 20.80 17°00" -7°23" Oph 
8.03 13.12 3.4 151 49 57 169 17.98 19.75 17514™ = -3°34™ Oph 
-7.5% 12.65 3.3 181 31 75 198 9.72 22.22 16"49m -8°46™ Oph 
6.3? 11.62 3.0 163 11 73 179 3.86 2410 16°30" -12°03" Oph 
6.13 11.48 2.9 121 19 47 131 8.92 20.99 16952" -10°04" Oph 
5.07 10.76 2.7 145 5 69 161 1.97 25.43 16912" -15°19" Sco 
-4.23 10.21 2.6 97 -15 33 104 8.79 18.58 16°16" -24°35" Sco 
-3.82 10.02 2.6 125 1 51 135 0.46 22.19 16928" -15°40" Oph 
-2.67 9.37 2.4 95 -9 47 106 5.41 26.22 169017 -17°05" Lib 
-2.33 9.21 2.4 75 -49 17 91 -33.16 10.74 15°10" -45°35™ Lup 
“1.52 8.81 2.3 83 -25 za 96 -16.76 25.31 15°32" -24°44™ Lib 


Stellar explosion times tsy (before present) for the initial masses m of the supernova progenitors. The subgroups of the Sco—Cen association are identified by superscript numbers (UCL= 2, LCC =3). 
Also listed are the mass-dependent Fe yields, M.j, (see also Extended Data Fig. 1) and the position coordinates (x, y, Z) as defined by the local standard of rest. For each position a distance D is given, 
the corresponding galactic coordinates (longitude / and latitude b), the equatorial coordinates (right ascension a and declination 6 in ICRS coordinates, J2000) and the stellar constellations (sc) in 
which the supernovae explosion sites would be observable at the present night sky. 
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Coherent feedback control of a single qubit in 


diamond 


Masashi Hirose! & Paola Cappellaro! 


Engineering desired operations on qubits subjected to the 
deleterious effects of their environment is a critical task in 
quantum information processing, quantum simulation and 
sensing. The most common approach relies on open-loop quantum 
control techniques, including optimal-control algorithms based 
on analytical! or numerical’ solutions, Lyapunov design? and 
Hamiltonian engineering’. An alternative strategy, inspired by 
the success of classical control, is feedback control>. Because of 
the complications introduced by quantum measurement’, closed- 
loop control is less pervasive in the quantum setting and, with 
exceptions”, its experimental implementations have been mainly 
limited to quantum optics experiments. Here we implement a 
feedback-control algorithm using a solid-state spin qubit system 
associated with the nitrogen vacancy centre in diamond, using 
coherent feedback? to overcome the limitations of measurement- 
based feedback, and show that it can protect the qubit against 
intrinsic dephasing noise for milliseconds. In coherent feedback, 
the quantum system is connected to an auxiliary quantum 
controller (ancilla) that acquires information about the output 
state of the system (by an entangling operation) and performs an 
appropriate feedback action (by a conditional gate). In contrast to 
open-loop dynamical decoupling techniques’, feedback control 
can protect the qubit even against Markovian noise and for an 
arbitrary period of time (limited only by the coherence time of 
the ancilla), while allowing gate operations. It is thus more closely 
related to quantum error-correction schemes'!!~"“, although these 
require larger and increasing qubit overheads. Increasing the 
number of fresh ancillas enables protection beyond their coherence 
time. We further evaluate the robustness of the feedback protocol, 
which could be applied to quantum computation and sensing, by 
exploring a trade-off between information gain and decoherence 
protection, as measurement of the ancilla—qubit correlation after 
the feedback algorithm voids the protection, even if the rest of the 
dynamics is unchanged. 

To demonstrate coherent feedback with spin qubits, we choose two 
of the most common tasks for qubits—implementing the no-operation 
(NOOP) and NOT gates, while cancelling the effects of noise. A simple, 
measurement-based feedback scheme, exploiting one ancillary qubit, 
was proposed in ref. 15. The correction protocol (Fig. 1a) works by 
entangling the qubit-ancilla system before the desired gate opera- 
tion. By selecting an entangling operation U. appropriate for the type 
of bath acting on the system, information about the noise action is 
encoded in the ancilla state. After undoing the entangling operation, 
the qubit coherence can be restored by a feedback action, that is, by 
measuring the ancilla and applying a correction operation U, on the 
qubit based on the measurement result. Although this scheme applies 
to a broad class of environments, in the case of dephasing noise, the 
existence of the appropriate unitaries, U.=0, and Uy=0az (where 
0, are Pauli matrices), has been proven’. The principle of deferred 
measurements'® allows postponing measurements until the end of a 
quantum algorithm. Here it enables replacing the ancilla measurement 


and classical feedback by coherent conditional gates. The final qubit 
state is then completely decoupled from the bath effects, which are 
instead imposed on the ancilla. Because the scheme is compatible with 
any gate that commutes with U,, it is more flexible than dynamical 
decoupling techniques. Another advantage is that the scheme is valid 
independently of the characteristic timescale of the bath and thus it can 
be used even in the presence of a Markovian bath for which dynamical 
decoupling fails. These two advantages enable the use of the feedback 
scheme for sensing, for example to detect transverse magnetic fields 
or, by a simple modification of the algorithm, to protect the sens- 
ing qubit from bit-flip errors while detecting fields aligned with the 
z quantization axis!7-!°. 

We demonstrated experimentally this feedback-based protection 
algorithm with a spin system associated with a single nitrogen- 
vacancy (NV) centre in diamond. This system has emerged as a good 
candidate for quantum information processing, because its electronic 
spin-1 can be optically polarized and measured, and presents long 
coherence times even at room temperature. Here we use an auxiliary 
qubit associated with the ‘“N nuclear spin-1 of the nitrogen vacancy, 
to protect the coherence of the NV electronic spin against dephasing 
noise. The Hamiltonian of the electronic-nuclear spin system is 


H = AS? + wS,+ QI; + wolz + ASA, + B (Sele + Sly) 


where S and J are the electron and nuclear spin operators, respectively, 
A= 2.87 GHz is the electronic zero-field splitting and Q= —4.95 MHz 
is the nuclear quadrupolar interaction. The two spins are coupled by 
an isotropic hyperfine interaction with a longitudinal component 
A=-—2.16 MHz and a transverse component”” B = —2.62 MHz that 
can be neglected to first order. A magnetic field is applied along the 
NV crystal axis [111] to lift the degeneracy of the ms = +1 level, yield- 
ing the electron and nuclear Zeeman frequencies w, and wy, respec- 
tively. In the experiment, we use a subspace of the total system 
representing a two-qubit system. For convenience, we choose the space 
spanned by the four levels {|7s, mz) =|0, 1), |0, 0), |—1, 1), |—1, 0)}. 
The effective Hamiltonian can then be rewritten by using spin-1/2 
operators and simplified by transforming to a rotating frame at the 
resonant frequency of the qubit (A — w,) and the ancilla (w_+ Q): 
H' =A(-—o$+02-o%07) /4. 

In our high-purity diamond sample, decoherence processes of the 
electronic spin can be mainly attributed to a bath of spin-1/2 °C nuclei 
in the lattice (1.1% natural abundance), yielding a short dephasing 
time T*, ~ 4 ts. We neglect the interaction between the bath and the 
ancillary spin, which couples very weakly to any source of magnetic 
field noise; indeed, the much longer coherence time of the My] spin, 
T%,~3 ms, is limited by the NV electronic spin relaxation and not by 
the °C bath. 

In the experiment, the electronic spin qubit is initialized into the 
ms=0 state by optical excitation at 532 nm. At the chosen magnetic 
field strengths (390-510 G), optical pumping also polarizes the nuclear 
spin into the m;=-++1 state, owing to resonant polarization exchange 
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Figure 1 | Feedback algorithm and experimental implementation. 

a, b, Nitrogen-vacancy (NV) centre in diamond (a) and relevant energy 
levels of the spin system (b), showing polarization processes under optical 
illumination*’. c, Quantum circuit. Hadamard gates (H) prepare and read 
out a superposition state of the qubit, |b), = (|0) + |1))//2. Amid 
entangling gates between qubit (‘q’) and ancilla (‘a’), the qubit is subjected 
to noise (lightning bolt) and possibly unitary gates U. We assume the 
ancilla is not affected by the bath (‘b), yielding 1, ® Uqp(7), with Ugp(7) 
the qubit-bath joint evolution. Given a dephasing bath, we set the 
entangling gate to U.=o;, (conditional-NOT gate). More generally, 

upon undoing the entangling operation, the system is left in the state 
|%(r)) = ([0).K*] dy 4) + |1)aK-|by &o))/J2, with K*= Ugy UUgpUt- 
The entangling gate U, is designed such that Kt = 1, ® xt and 

K-= Ul ® Xp in which x, act on the bath only, and U, acts on the qubit 
only. After measuring the ancilla, we could use a feedback operation U, to 
restore the correct qubit state. The ancilla measurement is replaced by 
coherent feedback (shaded region) obtained by a controlled-correction 
gate (here U, =o; (Z) for dephasing noise). The final state of the combined 
system is then (|0)ax;, + [1)aX,)I@)ql€)v //2, which reveals how the qubit 
is now decoupled from the bath. d, Experimental implementation. The 
laser excitation polarizes both spins. Black sinusoidal lines refer to 
selective microwave (MW) pulses acting only in the m;= 1 manifold (thus 
mimicking controlled-NOT gates); solid bars indicate non-selective pulses. 
The radio-frequency (RF) excitation describes selective pulses in the 
m;=0 manifold. We use 1/2 rotations about x to approximate Hadamard 
gates. To implement a non-selective radio-frequency 1/2 gate on the 
nuclear spin we embed a non-selective microwave 7 pulse into two 
consecutive radio-frequency 1/2 pulses. The controlled-correction gate is 
implemented by free evolution (¢,) under hyperfine coupling. 


with the electronic spin in the excited state’!. After initializing the 
qubit in the superposition state|d), = (|0). + |1).)/-/2, we implement 
the feedback-based protection algorithm using the control operations 
shown in Fig. 1b. Gate operations on the qubit and ancillary spins are 
performed by external microwave and radio-frequency fields”’, 
respectively (see Methods). Because the noise is dephasing in nature, 
the feedback-based protection algorithm requires the entangling gate 
to bea controlled-NOT gate, U-=0,, while the coherent feedback gate 
is implemented by a controlled Pauli-Z gate, Uy =a; (see Fig. la and 
Methods). Finally, the qubit state is optically read out by monitoring 
spin-state dependent fluorescence. 

We measure the fidelity of the NOOP and NOT gates while var- 
ying the protection time 7. As shown in Fig. 2a, the feedback algo- 
rithm protects the qubit coherence against the noise created by the 8C 
nuclear spin bath for times 7 > 1 ms, much longer than the dephasing 
time. The spin bath in this experimental system is non-Markovian 
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and so dynamical decoupling techniques would reach similar protec- 
tion times, as previously demonstrated”’; however, unlike dynamical 
decoupling, this method would also be effective in the Markovian 
noise regime, with its ultimate limit set only by the coherence time of 
the ancillary spin and the qubit relaxation. This limit could be over- 
come by re-initializing the ancilla or, if additional ancillas (such as 
nearby '%C spins) are available, by repeating the algorithm with fresh 
ancillas, or by concatenating layers of protections against multi-axis 
noise (see Methods). Moreover, our scheme is more flexible than other 
protection schemes, including coherent transfer to the ancilla qubit”4, 
as it allows the application of some gate operations on the qubit!. 
For example, in our implementation, the qubit was evolving under 
the action of the hyperfine coupling, as indicated by the coherent 
oscillations of the fidelity (these oscillations could be removed by an 
adequate timing choice). In addition, we can implement a gate on the 
qubit, for example a NOT gate, during the feedback-based protection 
algorithm (Fig. 2d), without degrading the protection, even at times 
much longer than the gate itself and independent of the time at which 
the gate is performed (see Methods). 

The success of the feedback-based protection algorithm relies 
on the fact that the increased qubit entropy, due to the coupling to 
the environment, is dumped on the ancilla. This entropy transfer 
could be revealed by measuring the state of the ancilla, which would 
yield information about the noise, while preserving the qubit state. 
If, instead, information about the ancilla is collected by a correlated 
qubit-ancilla measurement, the protection fails**, as would happen 
in a measurement-based feedback for an imperfect ancilla read-out. 
Here we investigate this trade-off between the protection power and 
information gain on the ancilla in two different experiments. First, 
we transfer a part of the ancilla entropy back to the qubit by using 
a conditional gate that maps the state of the ancilla onto the qubit, 
correlating the two qubit states. By changing the angle of rotation of 
the conditional gate we vary the strength of the ancilla measurement 
from a weak to a strong one. As more information about the ancilla is 
acquired, the fidelity of the protection gate decreases (Fig. 2b). Note 
that we can combine the conditional gate performing the weak meas- 
urement of the ancilla state with the last conditional Z-gate of the 
algorithm, thus in practice performing a conditional phase-shift gate. 
The qubit fidelity is maximized when the conditional gate performs 
the required 7x rotation on the qubit, whereas it decreases when a dif- 
ferent phase rotation is used. 

Although this experiment clearly shows that the effectiveness of the 
protection is lost when a measurement of the ancilla—qubit correlation 
is performed, we can show that similar results also arise naturally from 
non-ideal experimental measurement conditions. We thus repeated 
the experiment at a different magnetic field, for which our signal—the 
fluorescence intensity—carries information about the state of both the 
qubit and the ancilla, and their correlation. At low magnetic field, we 
can approximate the normalized measurement operator by 
Mo= (0) (0| @ I; however, for magnetic fields close to the excited-state 
level anti-crossing (B~500G such that the Zeeman energy w is close 
to the excited-state zero-field splitting A, ~ 1.4GHz) the normalized 
observable is My — (€|10)(10| + 700) (00]), where € and 7 account for 
the fact that each electronic-nuclear spin level considered has a differ- 
ent fluorescence emission intensity due to the excited state 
dynamics”®. 

Because the measurement observable at this magnetic field contains 
partial information about correlations between the qubit and ancilla 
states, the qubit fidelity at long times is reduced (Fig. 2a), reflecting 
the effects of the noise acting on the qubit. We thus reveal a trade-off 
between the amount of information acquired about the ancillary sys- 
tem and the protection that it provides to the qubit. In addition, we 
single out a measurement of the ancilla alone, without qubit-ancilla 
correlations, by comparing the signal acquired so far with the signal 
after the qubit is rotated by a 7 pulse just before detection. If the meas- 
urements were independent of the state of the ancilla, the average of 
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Figure 2 | Experimental demonstration of the feedback-based 
protection algorithm. a, The signal—normalized photoluminescence 
intensity—oscillates at the hyperfine coupling frequency, A= —2.16 MHz. 
The initial coherent superposition state of the qubit is preserved for a time 
T>1ms at B=390G (red circles), while we observe a sharp decrease in 
the signal amplitude at B= 514 G (black squares), where correlations 
between the qubit and ancilla states are partly measured. b, c, This 
behaviour is evident when we compare the fidelity at short (b) and long (c) 
times, for B= 390G (red circles) and B=514G (black squares). 

To highlight the differences while taking into account different 
photoluminescence intensities at the two fields, we normalized all the 
data so that at short times the signal has the same (maximum) contrast. 

d, Protected NOT gate (red circles). The coherence of the qubit is 


the two experiments should not vary in time (because it corresponds 
to measuring the identity operator); indeed, this is what we observed 
in experiments performed at the lower magnetic field (Fig. 3a, c). By 
contrast, for a magnetic field close to the level anti-crossing, the pres- 
ence of oscillations indicates that we can extract information about 
the ancilla. However, at times longer than the qubit dephasing time, 
the state of the ancilla has completely decohered (Fig. 3b, d), thus 
quenching these oscillations. 

In conclusion, we performed coherent feedback control on a sin- 
gle solid-state qubit associated with the NV centre in diamond. The 
feedback algorithm was applied to protect the qubit coherence, while 
performing two essential qubit gates—NOOP and NOT gates—during 
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protected for a time longer than the dephasing time (7 > T,) even when a 
NOT gate is applied. We compare the dynamics to the NOOP dynamics 
(black squares), demonstrating that the NOT gate inverts the state of the 
qubit, as indicated by the out-of-phase oscillations. e, Weak measurement 
of the ancilla. Normalized photoluminescence signal after a protection 
time T= 81s, as a function of the strength of the ancilla measurement. 

In the experiment, we vary the strength of the ancilla measurement by 
changing the angle of the last controlled phase rotation gate by varying 
its time. All error bars represent the standard deviation of the signal, 
calculated by error propagation from the photoluminescence intensity 

of the signal and of reference photoluminescence curves acquired for 
each data point for ms= {0, —1}. The lines are fits to the model presented 
in Methods. 


the protection time. We showed that this feedback-based protection 
algorithm protects the qubit coherence far beyond the dephasing time 
of the qubit, even if no active control is applied to decouple it from 
the noise. The algorithm can be extended to applications in quantum 
information processing and quantum sensing, and it could be imple- 
mented in many other hybrid spin systems, such as phosphorus” or 
antimony”* donors in silicon, defects in silicon carbide”, or quantum 
dots*°. Because we applied a coherent feedback protocol, thus avoiding 
measurement of the state of the ancilla, the decoherent effects of the 
bath are effectively stored in the ancilla. Consequently, we were able 
to explore an interesting compromise between the amount of infor- 
mation gained about the ancilla and the effectiveness of the protection 
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Figure 3 | Partial measurement of the ancilla. a~d, Comparison of the 
fidelity signal with (grey) and without (black) a 7 pulse on the qubit, 
revealing the amount of information acquired about the ancilla qubit state. 
Signals were measured at short protection times, T< T>. (0O—2 ps; a, b), 
and at longer times, 7 > Ts. (20—22 1s; ¢, d). Error bars represent the 
standard deviation of the signal (see Fig. 2). The data are fitted (lines) 
using the model described in Methods. At the lower magnetic field 
(B=390G; a, c), the average signals (red lines, obtained from the fits to 
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the data) exhibit only weak oscillations, indicating that at this field the 
measurement carries very little information about the ancilla state. For 
magnetic fields close to the level anti-crossing (B= 514G; b, d), the 
oscillation of the average signal is more pronounced and is observed until 
T < T%,, whereas it disappears at longer times. This signal behavior is an 
indication that the ancillary spin effectively decoheres on the T%, timescale 
as a result of the feedback algorithm. 
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algorithm. These results contribute to elucidating the robustness of 
feedback control and to paving the way for a more widespread adop- 
tion of close-loop control strategies for solid-state qubit systems. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Measurement-based versus coherent feedback. In feedback protocols, the goal is 
to control an open system (the plant) so that it undergoes a desired evolution or it 
reaches a desired state. To achieve this goal, feedback protocols engineer a second 
system (the controller) connected to the plant. In quantum feedback there are two 
possible strategies. In the first strategy, we measure the plant output and engineer a 
classical controller that manipulates the plant according to the measurement result 
(this is called a measurement-based feedback protocol). In the second strategy, the 
controller is a second quantum system, coherently coupled to the plant. This strat- 
egy, called coherent feedback’, allows avoiding measurements. The controller can 
gain information about the plant state by coherent operations, sharing information 
about the state via entangling operations. Then, the controller can manipulate the 
plant on the basis of this information to achieve the desired plant dynamics, by 
performing conditional operations on the plant. 

Although the first strategy is more intuitive and closer to the classical scenario, 
coherent feedback is more intriguing, both from a theoretical point of view, owing 
to its relation to information and entanglement theory, and from an experimental 
one, because avoiding measurements often simplifies the experimental implemen- 
tation. In addition, coherent feedback formally encompasses measurement-based 
feedback and, as we did here, a measurement-based protocol can be transformed 
into a coherent one. To clarify the two strategies, we present the feedback protocol 
we considered’ using both strategies. 

In both protocols, the ancillary qubit is first prepared in a superposition state, 
so that the initial state of the combined system is |W) = (|O)a + |1)a)|)qlE)b/V2, 
where |€), and |@), are the initial state of the bath and qubit, respectively. The 
qubit and the ancilla are then entangled by a controlled operation U;, 
|W) = (|O)al)q + |1)a Uel)q)|€)b //2. The choice of U, depends on the type of 
noise to be refocused. As explained in ref. 15, U- can be chosen so that it transforms 
the qubit-bath interaction into a block-diagonal form. This enables performing 
appropriate correction operations in each sub-manifold of the qubit. More intui- 
tively, if the action of the bath can be represented by a random unitary operator 
Ug = e!ro", with @, random and na unit vector, then U- should invert the action 
of the bath: ut UU: = Uty. After the system evolves under the action of the bath 
(and possibly of a desired gate), the entangling operation is undone, 
[¥2) = (|0)Ugoldqlé)s + |1aU!UyoUe\)q\)p)/~/2. Owing to the entangle- 
ment, information about the action of the bath is encoded in the state of the ancilla. 
Thus, in the measurement-based feedback algorithm, the ancilla state is measured 
(in the x basis) and, depending on the outcome, a classical control operation is 
applied to the qubit. We can rewrite the state just before the measurement as 


|W) = ; |+) (Ugo + UlUgpUe) at |) (Ugp a ULUgpUe) |P)ql€)o 


where |-+) are eigenstates of o%. With an appropriate choice of U-, 
Kt= Ugo U-UgoU! reduce to Kt = 1,8 Xe and K- = Uj @Xy where xt are 
operators acting on the bath only and U, is a unitary acting on the qubit. If the 
measurement of the ancilla yields the negative eigenstate, then we apply the cor- 
rection operation Ug, which is a unitary that depends on the type of bath that affects 
the qubit. In the simple noise model we considered above (unital noise with a 
stochastic Hamiltonian generator H,xo-n), the correction operation is Ug =io-n. 

In the experimental implementation, the main source of decoherence is dephas- 
ing noise; therefore, we fixed U.= 0, and U,=0;. Ina magnetometry scenario, 
when we want to correct for bit-flip errors while measuring longitudinal external 
fields, the gates would be U-= a, and Ug=ox. 

In the coherent feedback strategy, the measurement and classical correction 
operation are replaced by a coherent conditional operation (see Extended Data 
Fig. 1a, b). The measurement (that is, conveying information to an external 
observer) is not needed for the ancilla to be able to perform a correction operation, 
because the ancilla already encodes the necessary information about the qubit state 
and the action of the noise. 

At the end of the feedback protocol, the ancilla can be recycled to perform a 
second round of protection, thus allowing the protection to be extended beyond 
the coherence time of the ancilla. In the current experiment, we could achieve the 
first strategy by using coherent polarization exchange between the nitrogen 
vacancy (NV) and |N spins at lower (or much higher) magnetic field, at which 
optical polarization leaves the nuclear spin untouched. After a first run of the 
algorithm (for a protection time t < T},), we could swap the state of the NV 
electronic and nuclear spins, re-polarize the NV centre and swap back the state 
from the nuclear spin to the NV, before applying the feedback protection for 
another stretch of time. Alternatively, if the ancilla cannot be re-initialized, then 
we could use additional fresh ancillas (see Extended Data Fig. 1c), for example, 
other !C nuclear spins in the lattice. If two ancillas are available, then we can 
construct a concatenated feedback algorithm (see Extended Data Fig. 1d) that 
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protects the qubit against multi-axis noise. The qubit is then protected against, for 
example, both dephasing (Tz) and depolarizing (T\) noise. More broadly, the 
feedback-based protection algorithm could provide a first layer of protection 
against the strongest noise source, and be combined with more advanced quantum 
error-correcting codes (even on the ancilla itself) with the goal of achieving fault 
tolerance. 

Modelling of the time evolution of the qubit-ancilla system. We study the 
time evolution of the qubit during the algorithm under the effects of a bath, to 
better understand the action of the feedback-based protection algorithm and 
to obtain insight into its limits. Although in the experiment the spin bath is 
non-Markovian, we can use a simpler model with a Markovian bath to find an 
analytical solution, because it yields the same result up to a different form of the 
exponential decays. 

In the model, we consider dephasing of the qubit and the ancilla, and the lattice 
relaxation of the qubit, characterized by the timescales T¥,, T3, and T., respec- 
tively. Letting p be a state of the qubit and the ancilla, the time evolution under the 
Markovian environment is determined by the Lindblad equation*! 
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Given an initial state = |0)(0|-® |1)(1|n, the state after the NOOP gate for a 
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The effectiveness of the feedback-based protection algorithm can be evaluated by 
measuring the probability of retaining the initial qubit state. The state fidelity is 
then given by the ideal normalized measurement operator Mo = |0) (0|- that gives 
the measured signal S 
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S(r) = Trlp(r) Mol T 
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This result clearly indicates that the fidelity is not limited by the dephasing of the 
qubit T*., but by that of the ancillary spin and the T;, relaxation time of the qubit. 

If instead we measure the normalized observable at the magnetic field corre- 
sponding to the excited-state level anti-crossing (ESLAC), then we gather some 


information about the ancilla, resulting in a degradation of the protection 
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Although the first line still indicates that the state is protected by the presence of 
the ancilla, terms in the other lines, which decay at the rate of Ts originate from 
the measurement of the ancilla spin and can be observed as a reduction of the 
oscillations of signal in the experiment. 

In the main text we showed that it is possible to perform a NOT gate during the 
protection time. Because the gate time is typically short (58 ns in our experiments), 
we studied the subsequent evolution of the qubit, to show that its coherence is 
preserved for times longer than the dephasing time, T>, ~ 4 1s. In addition, it is 
possible to insert the NOT gate at any point during the protected evolution. Owing 
to the coupling with the ancilla qubit, the resulting signal is 


S3(7; 7) =Trlp,(7, T)Mo] 
(1) 


=> + _leos(ar AT) + cos(AT,) exp 


1 1 
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where 7, is the time at which the NOT gate is applied (with 7 the total protection 
time). This expression was used to fit the experimental results obtained in the main 
paper and the data presented in Extended Data Fig. 2. 
Weak measurement of the ancilla-qubit correlation. To observe the com- 
promise between protection and information gained on the ancillary qubit, we 
perform a weak measurement of the nuclear-electronic spin correlation by apply- 
ing a conditional gate after the feedback algorithm and before the detection of 
the NV electronic spin by optical read-out. As shown in Extended Data Fig. 3a, 
the weak measurement could be obtained by applying a conditional rotation of the 
NV electronic spin, after that spin had been rotated back to its population state 
by a 7/2 pulse. The rotation angle ¢ determines the strength of the measurement. 

However, these additional gates can be avoided by noting that the circuit is 
equivalent to replacing the conditional x Z-gate with a conditional @ Z-gate, where 
0= 1/2 —¢ sets the measurement strength; see Extended Data Fig. 3b. 

The state after the feedback algorithm is 
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where a(9) = 1+ cos(Ar/2+ 0)/cos(Ar/2). Assuming no contrast of the meas- 
urement of the nuclear spin («== 0), the signal is 


exp is exp us 
Tin Te 


Although for = x (a =0) we recover the ideal fidelity, protected against the elec- 
tron dephasing noise, when 6+ 1 we perform a partial measurement of the ancilla, 
but the signal now decays according to the time constant T,. 

Experimental set-up. Experiments were performed using an electronic-grade 
single-crystal diamond plate provided by Element 6, with [100] orientation and a 
nominal nitrogen concentration of less than 5 parts per billion. A home-built con- 
focal microscope was used to scan the sample (scanning piezo stage Nano-3D200, 
Mad City Labs) and to measure a single naturally occurring NV centre situated 
about 101m from the surface of the diamond. The NV centre is coherently excited 
by an optical pulse at 532 nm provided by a diode-pump laser (Coherent Compass 
315M) and shaped by an acoustic optical modulator (AOM) with rise time <7 ns 
(1250C-848, Isomet), and then delivered by means of a 100 x oil immersion objec- 
tive lens with numerical aperture NA = 1.3 (Nikon Plan Fluor). The fluorescence by 
the NV centre is collected by the same objective lens and then sent to a single-mode 
broadband fibre of NA = 0.12 (Font Canada), being connected to a single photon 
detector (SPCM-AQRH-13-FC, Perkin Elmer). An arbitrary waveform generator 
(AWG) with 1.2 GSs~! (AWG5014B, Tektronix) plays a central role in generat- 
ing microwave and radio-frequency pulses, as well as timing the AOM and the 
single-photon detector. Microwave pulses are shaped by mixing the signal from the 
AWG with a carrier microwave field provided by a signal generator (N5183A-520, 
Agilent) with the aid of an I/Q mixer (IQ-0318L, Marki Microwave), and then 
cleaned up by a switch (ZASWA-2-50DR-+, Mini-Circuits); radio-frequency pulses 


are directly generated by the AWG. The microwave pulse and radio-frequency 
pulses are combined after amplification (GT-1000A, Gigatronics for microwave 
and LZY-22+, Mini-Circuits for radio-frequency) and delivered to the sample via 
a thin copper wire with a diameter of 251s (Alfa Aesar). 

Dynamics and control of the two-qubit system. Hamiltonian of the NV-™N system. 
The NV-centre spin system consists of the electronic spin (S=1) and the 
nitrogen nuclear spin (J= 1) that interact strongly via the hyperfine coupling 
(Ax —2.16 MHz). A static magnetic field is applied to lift the degeneracy of the 
electronic ms= +1 and nuclear spin m;= +1 levels, thus yielding the Hamiltonian 


H = AS? + wSz+ QI? + wl + ASclz 


where S and J are the electron and nuclear spin operators, respectively, 
A= 2.87 GHz is the electronic zero-field splitting and Q= —4.95 MHz is the 
nuclear quadrupolar interaction. Here We/n =YenB are the Zeeman energies, with 
the gyromagnetic ratios y~ =2.8 MHzG" and 7, = —0.308 kHzG"! for the elec- 
tronic and nuclear spin, respectively. 

We define one qubit in the electronic spin subspace as |ms=0) =|1), and 
|ms=—1) =|0)q. A second, ancillary qubit is defined in the subspace of the nuclear 
spin as |my;=+1) =|1), and |m;=0) =|0),. Both spins are driven on-resonance 
on these selected transitions; therefore, we can neglect other parts of the Hilbert 
space and express the Hamiltonian in the rotating frame in the subspace of interest. 
We then rewrite the Hamiltonian in terms of spin-1/2 operators Oxy, , defined by 
the basis above as 


H=2(-0f +02— of?) 


The hyperfine interaction induces a frequency shift, thus enabling the selectivity 
of the microwave pulse to be adjusted by tuning the transition rate ({2.). A strong 
driving Qe = 20 MHz > A, is used to drive the qubit non-selectively for any ancilla 
state, whereas (2, = 500 kHz < A is used to drive the qubit selectively on the ancilla 
state, thus engineering conditional gates. The radio frequency is on resonance with 
the transition between |0, 1) and |0, 0). The transition rate is about (2, = 20-40 kHz, 
which reflects an enhancement due to electronic virtual transitions mediated by 
the transverse component of the hyperfine coupling”’. Because the nuclear Rabi 
transition rate is always smaller than the detuning given by the hyperfine interac- 
tion (Qa < A), only selective driving of the ancillary spin is available and uncon- 
ditional gates need to be engineered with composite pulses. 

Coherent control of the nuclear spin. The '‘N nuclear spin is coherently controlled 
by using a resonant radio-frequency field. In general, the nuclear spin state cannot 
be directly measured optically with high fidelity. Therefore, after driving the nuclear 
spin, we apply a gate to map the nuclear spin state to the NV electronic spin 
(Extended Data Fig. 4a). To calibrate the quantum gates required for the 
feedback-based protection algorithm (1/2 rotation), we first characterize the nuclear 
spin by measuring the resonance frequency of the transition |1)q|1),+>|1)q|0)a 
(Extended Data Fig. 4b) and the nuclear Rabi oscillations (Extended Data Fig. 4c). 
Despite the small gyromagnetic ratio of the nitrogen (7/, = 0.308 kHz G~’), its Rabi 
oscillations are substantially enhanced, owing to the transverse hyperfine coupling 
with the NV spin (we achieve an enhancement factor of about 20 over the bare Rabi 
frequency around 500G; ref. 20). To further characterize the nuclear spin ancillary 
qubit, we measured the dephasing time T°, by performing a Ramsey experiment 
(Extended Data Fig. 4d) and obtained T*, = 3.2 ms. 

Although the dynamics of the !4N spin under radio-frequency driving is rela- 
tively simple when the NV electronic spin is in one of the population states, further 
complications arise when the NV qubit is in a superposition state. These compli- 
cations stem from the electronic spin decoherence and its evolution due to the 
radio-frequency driving and the hyperfine interaction. 

For a typical nuclear spin gate, such as the 7/2 rotation required for the algo- 
rithm, we need to consider the decoherence process of the electronic spin, which 
progresses at a faster rate (Ts, ~ 4 1s) than the gate operation (t, + 30 1s). In addi- 
tion, we need to engineer unconditional gates, even when the driving field can only 
rotate the nuclear spin in one of the electronic spin manifolds. To protect the qubit 
state during the gate operation, a microwave 7 pulse is inserted in the middle of 
the radio-frequency pulse and is then compensated by adding a second microwave 
—n pulse at the end of the radio-frequency pulse. This procedure simultaneously 
solves the second issue, because it produces a non-selective gate. The Hamiltonian 
under the radio-frequency driving in the rotating frame is 1 =H + Herp with 
Har(t) = — Qe20%cos(Wppt + )/2 + J2 Qno?. Here 2, and 22, are the coupling 
strength of the electronic spin and the nuclear spin with the longitudinal and 
transverse component of the radio-frequency field, respectively. We can neglect 
the coupling of the nuclear spin with the longitudinal component of the radio- 
frequency field and the transverse component of the radio-frequency effects on 
the electronic field, because at most it induces a resonance shift (Bloch-Siegert 
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shift®”) that is refocused by the microwave 7 pulses embedded in the radio-fre- 
quency pulse. The longitudinal driving imposes an additional, quite large, phase 
modulation O(7) on the NV spin, even in the presence of the spin echo 7 pulses 


Qez 


7: 2r 
O(r) =- 5 i sin(Wppt + d)dt — f sin(Wrrt + ¢)dt 


(2) 


2 

= ez cos(WRET + ¢)sin [a= 
WRE 2 

Although, in principle, we can remove this phase by a proper selection of the 

timing, in practice this often clashes with other time requirements set by the need 

to refocus effects due to the hyperfine couplings (explained below). 

In our experiment, we used a more flexible strategy to remove this undesired 
modulation, which used time-proportional phase incrementation*? (TPPI) of the 
radio-frequency phase ¢(T). O(7) in equation (2) can be set to zero at any time 
by selecting (7) = —wppr. Unlike stroboscopic detection with a time interval 
67 =2n/wpp TPPI enables continuous measurement of the !N spin Rabi driving; 
it is also of great use when the time resolution of experiment system is limited and 
67 cannot be selected with enough precision. Defining U(r) = exp(—iHr) and 


Ri, = exp(+Fino;,), the two-spin propagator V(r) under TPPI conditions is 
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where we took the limit A >> (2, to neglect off-resonance driving. Owing to the 
combination of the hyperfine coupling Ao’o7 /4 and the radio-frequency driving 
QB, x0", /2, the propagators of the nuclear spin before and after the microwave T 
pulse do not commute and the nuclear spin evolves about non-parallel axes in the 
two time intervals. This can be directly observed as a modulation of the electronic 
spin echo, similar to the more common Electron Spin Echo Envelope Modulation 
(ESEEM)* for anisotropic hyperfine coupling. To avoid the effects of the hyperfine, 
we set AT =2n. Because A >> (2p, we can choose a timing 7 that fulfils both the 
above condition and the desired radio-frequency pulse time, yielding V(r) = 0°. 
Measurement model and nuclear-spin-dependent fluorescence intensity. Measurement 
of the NV spin state is achieved by monitoring the photoluminescence under laser 
excitation at 532 nm. Owing to spin-dependent photodynamics in the excited state, 
the photoluminescence intensity is correlated with the NV electronic spin popu- 
lation. At magnetic fields close to the ESLAC, the photoluminescence intensity 
becomes modulated by the nuclear spin state as well, owing to the strong hyperfine 
coupling between the electronic and nuclear spins in the excited state”. Therefore, 
the observable in the experiment can be modelled by the operator 


M= m,1|1, 1)(1, 1| + ny,o|1, 0) (1, 0| + No,1|0, 1)(0, 1| + 70,0|0, 0) (0, 0| 
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where nm ,mq ate stochastic variables denoting the total number of photons*® 
detected during the measurement time (300 ns) from the initial state |r, m,). In 
writing the observable, we neglected the ms= +1 and m;=-+1 states, which are 
never populated in the experiment. To reduce the photon shot noise, all measure- 
ments are repeated 5 x 10° times. The raw photoluminescence signal S at each 
measured point of the sequence is normalized by the photoluminescence collected 
from two fiducial states |1, 1) and |0, 1). These states are prepared at each average 
from optical polarization and optical polarization followed by adiabatic passage of 
the electronic spin to the |1, 1) state. The raw signal from a state p, is then normal- 
ized to yield 


where we define the normalized measurement operator M as 


1 0 0 0 
xz _|0 l1-—e 0 0 
M=\y 0 0 0 
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Here €=((11,1) — (m10))/((m1,1) — (No,1)) and 7 = ((10,1) — (10,0))/((111,1) — (10,1) 
describe the optical distinguishability of the nuclear spin states in each electronic 
manifold. Usually, the nuclear spin state does not affect photoluminescence inten- 
sity, that is, E=7=0, because it does not affect the relaxation process of the NV 
centre. However, close to the ESLAC, dynamics driven by the strong excited-state 
hyperfine coupling renders the relaxation process of the NV spin correlated with 
the nuclear spin state”®, leading to an optical contrast among the nuclear spin 
states. Therefore, the strength of measurement of the nuclear spin can be varied 
by changing the static magnetic field. In our experiments we choose two working 
conditions that, although still allowing nuclear spin polarization (P > 0.9) by optical 
pumping, could highlight differences due to partial measurement of the ancilla 
spin. Consequently, we performed the experiment at two different magnetic fields 
(B=390G and B=514G) characterized by different strengths of the nuclear-spin 
optical contrast (« =0.14, 7=0.16 at B= 390G and e«=0.86, 7 =0.25 at B=514G). 
These values were estimated by measuring the photoluminescence intensity for all 
four states (see Extended Data Fig. 5). 
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Extended Data Figure 1 | Feedback circuit. See, for example, ref. 37 for 
an explanation of the notation. a, b, Measurement-based (a) and coherent 
(b) feedback algorithms. In the shaded regions we highlight the differences 
between the two strategies. The measurement-based protocol requires a 
measurement of the ancilla and subsequent classically controlled operation 
(double lines indicate a classical wire). The coherent feedback protocol 
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does not perform a measurement, but requires a coherent controlled 
operation. c, Re-initializing the ancilla or using multiple fresh ancillas can 
extend the feedback protection beyond the coherence time of the ancilla. 
d, A concatenated feedback algorithm with two ancillas can protect the 
qubit from general noise (applied along any axis). 
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Extended Data Figure 2 | Protected gate. The feedback algorithm is the hyperfine coupling; see equation (1). c, More complex evolution is 
compatible with the application of NOT gates at any point during the obtained when inserting the NOT gate at other times. Here we show the 
protection time. a, Rabi oscillations embedded in the feedback-based behaviour for 7,,=7/4. Black circles are experimental data with error bars 
protection algorithm. b, When the NOT gate is applied in the middle representing their standard deviation (see Fig. 2); the solid lines are fits 
of the protection time, it halves the period of the oscillations due to using equation (1). PL, photoluminescence. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


a Feedback Partial b 
algorithm measurement 

Qubit ---- 

Ancilla ---- 


Extended Data Figure 3 | Weak measurement circuit. a, After the 
feedback algorithm is completed, a controlled-X rotation entangles the 
qubit with the ancilla, allowing us to perform a partial measurement of the 
ancilla: y/)e(@|1)n + {]0)n) > & cos(¢)|y/)el1)n + 8 sin(4) (o;ly).)|0)n. The 
strength of the measurement can be adjusted by the angle ¢. b, The control 
sequence in a is simplified by combining the partial measurement gates 
with the preceding gates of the feedback algorithm. As a result, to 
implement the weak correlated measurement in the experiment, we simply 
perform a controlled-phase rotation gate with 0 = 1/2 — ¢, instead of the 
controlled-Z (@= 7) rotation that is required for the feedback algorithm. 
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Extended Data Figure 4 | Control and coherence of the nitrogen nuclear —_ measured nutation rate (Rabi frequency) is 2, + 26.3 kHz (ref. 20). 


spin. a, Typical control sequence. The first laser pulse initializes the spins d, Nuclear Ramsey fringes under the sequence 1/2-7-1/2, where T is 

into the |0, 1) state. After a nuclear-spin gate (radio-frequency (RF) the free evolution time. We measure a dephasing time of the nuclear spin 
driving), to detect the nuclear spin state we use a microwave (MW)- T3, = 3.2 ms, which is limited by the NV electronic spin lattice relaxation 
selective 7 pulse that maps the nuclear spin state onto that of the NV spin, process (T =4.5 ms; red circles and dashed line). Error bars represent the 
which is subsequently detected by the second laser pulse. b, Nuclear standard deviation of the signal, calculated by error propagation 
magnetic resonance at about 390 G. We sweep the radio frequency at fixed from the photoluminescence intensity of the signal and reference 

pulse length. The dip in the photoluminescence (PL) spectrum indicates photoluminescence curves acquired for each data point for ms = {0, —1}. 
the resonant frequency of the |0, 1) > |0, 0) transition. c, Nuclear Rabi Lines are fits to the data. 


oscillations. We drive resonantly the |0, 1) < |0, 0) transition. The 
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Extended Data Figure 5 | Electronic- and nuclear-spin-dependent 
fluorescence at different magnetic field strengths. At the lower magnetic 
field (left; B= 390 G), the fluorescence intensity shows only a weak 
dependence on the nuclear spin state in the ms = —1 manifold (indicated 
by the overlapping data), whereas at B= 514 G, which is very close to the 
ESLAC, a strong dependence on the nuclear spin state is observed in both 
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manifolds (right; indicated by the non-overlapping data). From these 
fluorescence measurements, we obtained the parameters (¢, 7)) used to 
model the measurement operator. In the experiments, the detection time 
delay and window were optimized to obtain the maximum contrast of the 
state at each magnetic field. 
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Quantum hydrogen-bond symmetrization in the 
superconducting hydrogen sulfide system 


Ion Errea!*, Matteo Calandra, Chris J. Pickard*, Joseph R. Nelson, Richard J. Needs°, Yinwei Li®, Hanyu Liu’, Yunwei Zhang®, 


Yanming Ma® & Francesco Mauri*? 


The quantum nature of the proton can crucially affect the structural 
and physical properties of hydrogen compounds. For example, in 
the high-pressure phases!” of HO, quantum proton fluctuations 
lead to symmetrization of the hydrogen bond and reduce the 
boundary between asymmetric and symmetric structures in the 
phase diagram by 30 gigapascals (ref. 3). Here we show that an 
analogous quantum symmetrization occurs in the recently 
discovered* sulfur hydride superconductor with a superconducting 
transition temperature T. of 203 kelvin at 155 gigapascals—the 
highest T, reported for any superconductor so far. Super- 
conductivity occurs via the formation of a compound with chemical 
formula H;S (sulfur trihydride) with sulfur atoms arranged on 
a body-centred cubic lattice*°. If the hydrogen atoms are treated 
as classical particles, then for pressures greater than about 
175 gigapascals they are predicted to sit exactly halfway between 
two sulfur atoms in a structure with Im3m symmetry. At lower 
pressures, the hydrogen atoms move to an off-centre position, 
forming a short H-S covalent bond and a longer H---S hydrogen 
bond in a structure with R3m symmetry*°. X-ray diffraction 
experiments confirm the H3S stoichiometry and the sulfur lattice 
sites, but were unable to discriminate between the two phases!®. 
Ab initio density-functional-theory calculations show that quantum 
nuclear motion lowers the symmetrization pressure by 
72 gigapascals for H3S and by 60 gigapascals for D3S. Consequently, 
we predict that the Im3m phase dominates the pressure range 
within which the high T. was measured. The observed pressure 
dependence of T. is accurately reproduced in our calculations for 
the Im3m phase, but not for the R3m phase. Therefore, the quantum 
nature of the proton fundamentally changes the superconducting 
phase diagram of H3S. 

The recent discovery of high-temperature superconductivity in 
compressed hydrogen sulfide* has led to a number of theoretical stud- 
ies aimed at understanding the phase diagram of the H-S system and 
the origin of the astonishingly high T. that was observed>*'?"!8. The 
overall consensus is that H2S, the only stable compound formed by 
hydrogen and sulfur at ambient conditions, is metastable at high pres- 
sures and its decomposition gives rise to several H-S compounds. 
High-T- superconductivity is believed to occur in a structure with H3S 
stoichiometry, and is considered to be conventional in nature, that is, 
mediated by electron-phonon interactions**”?"!”, Alternatives to 
conventional superconductivity have also been discussed'’. According 
to structural predictions®°, H3S adopts a rhombohedral R3m form 
between approximately 112 GPa and 175 GPa, anda cubic Im3m form 
at higher pressures. As shown in Fig. 1, the R3m phase is characterized 
by covalently bonded SH; units with a covalent H—S bond of length 
d,. Each of these H atoms is bonded to the next S atom by a hydrogen 


H.---S bond of length d,. By contrast, the Im3m phase has full cubic 
symmetry, with d; = d2 so that each H atom resides midway between 
the two S atoms, as shown in Fig. 1. The R3m structure is nevertheless 
very close to cubic symmetry; for example, the density functional the- 
ory (DFT)-relaxed R3m structure, which represents the minimum of 
the Born—Oppenheimer energy surface (BOES), has a rhombohedral 
angle of 109.49° at approximately 150 GPa, compared to 109.47° for a 
perfect bcc lattice. We verified that imposing a cubic angle on the R3m 
structure has a negligible effect on the energy difference between the 
R3mand Im3m structures. Consequently, we assume a cubic lattice 
for both phases in the following. 

The bond-symmetrizing second-order transition from R3m to 
Im3m occurs at 175 GPa according to our static lattice calculations. At 
this pressure, our harmonic phonon calculations show that a [’-point 
optical phonon of the high-symmetry Im3m phase becomes imagi- 
nary, implying that Im3m is at a saddle point of the BOES between 
112 GPa and 175 GPa, whereas the R3m phase lies at the minimum. 
Crystal symmetry guarantees that the transition is second-order (see 
Methods for a symmetry analysis). As occurs in the high-pressure ice 
X phase*!*”° and other hydrogenated compounds”!, the quantum 
nature of the proton can radically alter the pressure at which the 
second-order phase transition occurs and, in the present case, can 
strongly affect the stability of the R3m phase below 175 GPa. 
Determining the stability ranges of these phases therefore requires the 
inclusion of vibrational zero-point energy (ZPE) along with the static 
BOES energy. However, the presence of imaginary phonon frequencies 


Figure 1 | Crystal structures of the competing phases. Crystal structure 
in the conventional bcc cell of the R3m (left) and Im3m (right) phases. 

In the R3m structure, the H—S covalent bond of length d, is marked with a 
solid line and the longer H---S hydrogen bond of length d, with a dotted 
line. In the Im3m phase, d, = d>. The lattice parameter a is marked in the 
Im3m structure. Blue and pink atoms represent S and H atoms, 
respectively. 
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Figure 2 | Energetics. a-d, Ego, Eyip and E = Eyi, + Ego curves for H3S 

(a, c) and D3S (b, d) are shown as functions of the reaction coordinate Q 
that transforms the Im3m structure (Q=0) into the R3m structure (Q=1), 
as well as the relative coordinate x = (d) — a/2)/(a/2) that measures the 
off-centring of the H atoms, where d) is the length of the hydrogen bond 
and a is the lattice parameter (see Fig. 1). The left-hand (black) axes show 
the energy scale for E,;, and E; the right-hand (red) axes show the scale 

for Ego. Crystal symmetry implies that E(Q) = E(-Q), so that the curves 
can be fitted to polynomials with only even terms; this guarantees that the 


hinders calculations of the ZPE, because the quasi-harmonic approx- 
imation breaks down, and anharmonicity becomes important. 

To elucidate the role of anharmonicity and quantum effects in the 
pressure range in which the record T. = 203 K was observed, we make 
use of the stochastic self-consistent harmonic approximation 
(SSCHA)?. The variational SSCHA method was devised for calcu- 
lating the free energy and phonon spectra while fully incorporating 
quantum and anharmonic effects, and so is perfectly suited for our 
purpose. All of the calculations presented here are performed at 0 K. 
Primitive cells for the R3m and Im3m structures contain four atoms 
(one S atom and three H atoms); consequently, a particular nuclear 
configuration can be described by a 12-dimensional vector R contain- 
ing the atomic coordinates. In the classical limit, the ZPE is neglected 
and the energy of a nuclear configuration R is given by the DFT Born- 
Oppenheimer energy Ego(R). In the SSCHA, the ZPE is accounted 
for by approximating the nuclear wavefunction by a Gaussian centred 
on a centroid coordinate R., which denotes the average and most 
probable position of the nuclei. For a given R,, the width of the 
Gaussian is obtained by variational minimization of the expectation 
value of the sum of the nuclear potential and kinetic energies. In the 
following analysis, it is convenient to split the SSCHA total energy 
E(R,) into static and anharmonic-vibrational-ZPE contributions: 
E(R.) | Ego(R-) =r Evin(R.). 

We study the energy landscape E(R,) along the line defined 
by R.(Q) = Rin3m+ Q(Rr3m _ Rin3m)> in which Rin3m and Rp3m are, 
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transition is second-order according to Landau theory’. Results are 
presented for two different volumes of the primitive bcc lattice; 
V = 97.85a; (a, b) corresponds to approximately 150 GPa and V= 102.114} 
(c, d) to 130 GPa. The pressure associated with each volume depends on 
both the isotope and Q (see Extended Data Fig. 1 for the equations of 
state). Black circles represent calculated E,i, points and the black dashed 
line the fitted E,;,(Q) curve (see Methods). The E(Q) curve is obtained by 
addition of the fitted E,;, and Ego curves. 


respectively, the coordinates corresponding to the saddle point and 
minimum of the BOES, representing the two different symmetries. 
Here, Q is a real number describing the reaction coordinate, so that at 
Q=0 the centroids are located at the atomic positions of Im3m, and 
at Q=1 at the atomic positions of R3m. Hence, Q measures the 
off-centring of the hydrogen nuclear wavefunction and can be associ- 
ated with the relative coordinate x = (d) — a/2)/(a/2) that quantifies 
the length of the H---S hydrogen bond with respect to the symmetric 
position (a is the lattice parameter). We analyse the curve 
E(R.(Q)) = E(Q) for a fixed primitive bcc unit cell. As shown in 
Fig. 2a for a cell volume of 97.8543 (do is the Bohr radius), the Ego(Q) 
curve has a shallow double-well structure that favours the R3m struc- 
ture by only 5.6 meV per H3S. However, after adding the Ey,(Q) energy 
calculated with the SSCHA, the full E(Q) curve shows a clear mini- 
mum at Q=0, which favours the Im3m structure. At a larger volume 
of 102.11a;, which corresponds to a pressure of about 130 GPa, the 
minimum of E(Q) is also at Q=0, despite the fact that the one- 
dimensional Born—-Oppenheimer well in Ego(Q) becomes deeper, as 
shown in Fig. 2c. Upon repeating these calculations for D3S, we find 
that the Im3m structure is the most favourable once the ZPE has been 
included. Therefore, we conclude that the quantum nature of the nuclei 
symmetrizes the hydrogen bond and leads to a proton wavefunction 
that is centred at the atomic positions of Im3m for both H3S and D3. 
To eliminate the possibility that the energy minimum occurs beyond 
the R.(Q) line studied, we performed an unconstrained SSCHA 
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Figure 3 | Second-order phase transition. a, For each volume V we plot the 
relative coordinate x that yields the minimum total energy; x measures the 
off-centring of the H atoms (see Fig. 2). The results are shown in the classical 
nuclei limit (black), as well as in the quantum case both for H3S (green) and 
D3S (purple). The volume at which x departs from zero marks the second- 
order phase transition from the Im3m phase to the R3m phase. Transition 
pressures are also indicated, and include the effects of vibrational energies. 
b, Phase diagram for the second-order phase transition as a function of 
pressure. As shown in ref. 8, below 112 GPa, H3S adopts a very different 
C2/c phase; we mark the expected emergence of this phase by a box. 


minimization, optimizing both the width of the Gaussians and the R, 
centroid positions. The results of this minimization confirm that, 
within stochastic error, the centroid position obtained corresponds to 
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the Im3m structure, in which the H-S covalent and H---S hydrogen 
bond distances equalize, leading to symmetric hydrogen bonds. 

The difference between the vibrational energies of R3m and Im3m 
as a function of the x coordinate is weakly dependent on volume. This 
allows us to interpolate E(x) in a wide volume range and estimate the 
pressure at which the proton wavefunction shifts away from the cen- 
tred position. Our calculations show that this symmetry breaking 
occurs at 103 GPa in H3S and 115 GPa in D3S (see Fig. 3). The 
higher transition pressure in D3S is due to weaker quantum effects. 
This isotope effect is similar to that observed in the ice VII/ice 
X transition™. Considering that, below 112 GPa, the R3m phase is 
expected to transform into a very different monoclinic C2/c phase 
consisting of isolated H2S and H2 molecules with H3S stoichiometry’, 
R3m H3S might not be formed. However, D3S may adopt the R3m 
structure at pressures below the transition to the Im3m phase. 

The quantum proton symmetrization has a very large impact on 
the phonon spectra of H3S. As mentioned previously, and shown in 
Fig. 4, the phonon spectra of Im3m H3S have several imaginary modes 
in the harmonic approximation below 175 GPa. The corresponding 
anharmonic SSCHA phonon spectra for Im3m HS show well-behaved 
phonon dispersion relations with positive frequencies in the pressure 
range of interest (Fig. 4). This behaviour is analogous to ice X, which 
has only real positive phonon frequencies in the regime in which the 
classical limit predicts symmetrization of the hydrogen bond”>*®, The 
anharmonic renormalization of the phonon energies is huge, especially 
for the H-S bond-stretching modes in which H atoms move towards 
the neighbouring S atoms, which are precisely those modes that drive 
the second-order phase transition between the Im3m and R3m phases. 
Therefore, the proximity to the second-order quantum phase transi- 
tion is the origin of the strong anharmonicity. 

Although the bond symmetrization in ice X occurs in an insulating 
system, HS is metallic and the symmetrization strongly affects the 
superconductivity. The calculated electron-phonon coupling and T- 
lend further support to the suggestion that Im3m H3S yields the record 
T-=203 K. We use Wannier-interpolated electron-phonon matrix 
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Figure 4 | Phonon spectra and superconducting transition temperature. 
a, b, Harmonic and SSCHA anharmonic phonon spectra of the cubic high- 
symmetry Im3m structure for H3S at different pressures: 135 GPa (a) and 
157 GPa (b). ¢, T. calculated with the anharmonic phonons for the Im3m 
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structure compared with experimental results obtained after annealing’. 
T. results obtained with anharmonic phonons for the R3m structure below 
175 GPa are also shown. Each pressure takes the vibrational energy into 
account. 
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elements in our calculations” and estimate T. by solving the isotropic 
Migdal-Eliashberg equations. The phonon frequencies and polariza- 
tions that enter the electron-phonon calculations are calculated using 
the SSCHA. Therefore, in the present treatment we neglect nonlinear 
corrections of the electron-phonon vertices and non-adiabatic effects 
that could arise from the small mass of H. The results obtained for the 
Im3m structure using anharmonic phonon frequencies agree well with 
experimental measurements of T- for H3S and D3§, and correctly cap- 
ture the observed increase in T, with decreasing pressure. We also find 
an isotope coefficient a = —{In[T,(D3S)] — In[T.(H3S)]}/In(2) for 
H—D substitution of a=0.35 at 210 GPa and a= 0.40 at 155 GPa in 
good agreement with experiment (see Fig. 4c). The electron-phonon 
coupling constant , which scales with the phonon frequencies as 1/w”, 
is enhanced with decreasing pressure owing to the overall softening 
of the phonon modes. This explains the smooth decrease in T. with 
increasing pressure. Between approximately 130 GPa and 150 GPa, the 
increase in \ is compensated by the decrease in the average phonon 
frequency, and T, saturates. We also present SSCHA calculations for 
the R3m structure keeping the centroids at the Q= 1 position. We find 
a rapid decrease in T- with decreasing pressure as in previous harmonic 
calculations'*. Therefore, the observed high-T; superconductivity can- 
not be explained by HS in the R3m phase, although the sudden 
decrease in T. measured for D3S below 150 GPa (ref. 4) could be an 
indication of the symmetry breaking that we predicted at 115 GPa. 
Indeed, the predicted transition pressure depends on the choice of the 
exchange-correlation functional (see Methods). Even if our choice of 
the PBE (Perdew-Burke-Ernzerhof) exchange-correlation 
functional*° seems appropriate, on the basis of the agreement between 
the experimentally observed equation of state'° and DFT calculations, 
we cannot exclude a small error in the transition pressure. However, 
the isotopic shift in the transition pressure is independent of the func- 
tional. Thus, if the decrease in T. observed experimentally in D3S is 
due to the transition from R3m to Im3m, then we predict that a similar 
decrease also occurs in H3§, but is shifted to lower pressures by about 
12 GPa. Considering that the I m3m phase has no Raman active modes, 
if the drop in T, in D3S coincides with the second-order phase transi- 
tion, then we predict that Raman peaks from the R3m phase would 
emerge at that pressure. 

A recent experiment* suggests that room-temperature supercon- 
ductivity is reachable in other hydrogen-rich compounds. Our results 
show that in such hydrogen-rich materials with prospects for high-T: 
superconductivity, the quantum motion of the proton induces 
non-trivial effects that strongly affect the thermodynamical stability, 
the hydrogen chemical bonding and the electron-phonon coupling 
strength. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Calculation details. Supercell calculations for the SSCHA””? and linear response 
calculations"! were performed within DFT and the generalized gradient approxi- 
mation (GGA) functional*® as implemented in the Quantum ESPRESSO* code. 
We used ultrasoft pseudopotentials’, a plane-wave cut-off energy of 60 Ry for the 
kinetic energy and 600 Ry for the charge density. The charge density and dynamical 
matrices were calculated using a 32 x 32 x 32 Monkhorst-Pack-shifted electron- 
momentum grid for the unit cell calculations. This mesh was adjusted accord- 
ingly in the supercell calculations. The electron-phonon coupling was calculated 
by using electron and phonon momentum grids composed of up to 42 x 42 x 42 
randomly displaced points in the Brillouin zone. The isotropic Migdal-Eliashberg 
equations were solved using 512 Matsubara frequencies and a Coulomb pseudo- 
potential ju" =0.16. 

The SSCHA calculations were performed using a 3 x 3 x 3 supercell for both 
HS and D3$ in the Im3m phase, yielding dynamical matrices on a commensurate 
3 x 3 x 3 q-point grid. The difference between the harmonic and anharmonic 
dynamical matrices in the 3 x 3 x 3 phonon momentum grid was interpolated to 
a6 x 6 x 6 grid. Upon adding the harmonic matrices to the result, the anharmonic 
dynamical matrices were obtained on a 6 x 6 x 6 grid. These dynamical matrices 
were used for the anharmonic electron-phonon coupling calculation. The SSCHA 
calculations for Q= 1 were performed with a 2 x 2 x 2 supercell. For consistency, 
the vibrational energies presented in Fig. 2 were also calculated using a2 x 2 x 2 
supercell. However, the electron-phonon calculations for Q= 1 were performed 
with the SSCHA dynamical matrices interpolated to a 6 x 6 x 6 grid from the 
2x 2x2 mesh. 

The Eyi,(Q) curves in Fig. 2 were obtained as follows. Ey, was calculated for 
Q=0 and Q=1 with the SSCHA. With the SSCHA calculation at Q= 1, we 
extracted Men (Q= 1)with no further computational effort. Considering that the 


derivative of the curve at Q=0 vanishes by symmetry, we obtain straightforwardly 
a potential of the form Ey(Q) = A + BQ? + CQ! (in which A, B and C are 
constants). The Eyip fit curves presented in Fig. 2 were obtained in this way. The 
extra point obtained at Q=0.5 for H3S at V= 97.854, (see Fig. 2a) confirmed the 
validity of the fitting procedure. The Ego(Q) BOES energies were calculated for 
many Q points, yielding an accurate fitting curve. Figure 3 was obtained using a 
polynomial interpolation of the BOES in the volume range shown and adding the 
EAS" (x) — EX™3" curves that are practically independent of volume. 

Symmetry analysis of the second-order phase transition. The bond-symmetri- 
zation transition from R3m to Im3m is a second-order transition and is driven by 
the softening of an optical mode at I’. The mode driving the transition belongs to 
the irreducible representation T},, also denoted as °,, whose dimension is three 
(refs 34, 35). This irreducible representation is compatible with a group-subgroup 
relation between the Im3m and R3m space groups. Because the mode driving the 
transition is at the [ point, the transition occurs without increasing the size of the 
unit cell, which contains four atoms in the primitive cell of the bcc lattice of the 
Im3m structure as well as in the rhombohedral lattice of the R3m phase. 

Ina transition from the m3m (Oy) point group to the 3m (C;,), the most general 
free energy expansion contains only even terms of the order parameters*®. 
Therefore, according to Landau theory!!, the transition must be second-order. Our 
reaction coordinate Q used to describe the second-order transition is a simplified 
order parameter. The fact that E(Q) = E(-Q) confirms that the transition is 
second-order. 

Equations of state. In Extended Data Fig. 1 we present the equation of state both 
for the Im3m and R3m phases. The pressure is calculated both with and without 
the vibrational contributions to the energies of H3S and D3S. The curves that 
include the vibrational contribution to the energy are used to determine the vibra- 
tional effects on the calculated pressures. The vibrational energy calculated 
includes anharmonicity in all cases as calculated by the SSCHA. The vibrational 
energy has a small contribution to the calculated pressure, which varies slightly 
with the isotopic mass. The pressure-volume curve can be efficiently fitted to the 
third-order Birch-Murnaghan equation. The parameters of the fit are given in 
Extended Data Table 1. 

Dependence on the exchange-correlation functional. All calculations presented 
here are calculated within the GGA under the PBE parameterization*”. However, 
we performed additional calculations within the local density approximation 
(LDA)*” and the BLYP (Becke-Lee-Yang-Parr)***? parameterization of the GGA. 
The second-order phase transition at the static level from Im3m to R3m, which 
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occurs at 175 GPa within PBE, is reduced to 145 GPa in the LDA and increased to 
257 GPa with the BLYP parameterization. In Extended Data Fig. 2 we compare the 
relative coordinate x = (dj — a/2)/(a/2) as a function of volume, where d is the 
length of the hydrogen bond and a is the lattice parameter. The x coordinate is 
obtained both with the static BOES energy Ego and with the total energy 
E= Exo + Evip. However, the vibrational energy is assumed to be independent of 
the functional, and the value obtained with PBE is used for estimating Ey, for both 
LDA and BLYP. We believe that this assumption is justified because the differences 
between the vibrational energies of Im3m and R3m calculated within the SSCHA 
and using PBE are very weakly volume dependent, as shown in Extended Data 
Fig. 3. At the static level, the LDA symmetrizes the structure at a pressure 30 GPa 
below that predicted by PBE, whereas BLYP symmetrizes it at a pressure 82 GPa 
above that predicted by PBE. When including the vibrational contribution as 
described above, the transition between Im3m and R3m occurs at 170 GPa for H3S 
and 185 GPa for D3S within BLYP. The shift in the transition pressure induced by 
the ZPE and its isotope dependence is similar for PBE and BLYP. The fully sym- 
metric Im3m H3S structure is always favoured within the LDA for H;S over the 
pressure range studied, and it only becomes unfavourable below 89 GPa for D3S. 

Although the choice of exchange-correlation functional adds some uncer- 
tainty to the predicted transition pressure, we believe PBE is the most appro- 
priate choice. This statement is based on the fact that PBE best reproduces the 
experimental equation of state results of ref. 10. BLYP overestimates the volume, 
especially when the vibrational contribution is considered (see Extended Data 
Fig. 4). Indeed, it is well known that BLYP vastly overestimates the equilibrium 
volumes in metals“° and, therefore, it is not the best choice for H3S. On the 
contrary, when including the vibrational contribution to the pressure, which is 
calculated in all cases using the PBE functional, the PBE equation of state is in 
rather good agreement with experimental results’ and certainly performs better 
than LDA and BLYP. 

Phonons of Im3m D;S. In Extended Data Fig. 5 we show the harmonic and anhar- 
monic phonon spectra of D;S in the Im3m phase at two different pressures. 
Anharmonic phonon dispersion in the R3m phase. In Extended Data Fig. 6 we 
show the SSCHA phonon spectra calculated for the R3m phase at two different 
pressures. Here the centroid position of the SSCHA is placed at the minimum of 
the BOES, which is not a minimum of the total energy, as discussed. The Raman 
active modes of the R3m phase are listed in Extended Data Table 2. 
Superconducting properties. In Extended Data Table 3 we summarize the calcu- 
lated values of the electron-phonon coupling constant A, logarithmic frequency 
average Wog and T.. 

In Extended Data Fig. 7 we show the Eliashberg function a7F(w) and integrated 
electron-phonon coupling constant \(w) of the Im3m phase at two different pres- 
sures calculated using the SSCHA method. The Eliashberg function a*F(w) in the 
anharmonic case for the R3m structure is given in Extended Data Fig. 6. 
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Extended Data Figure 1 | Equations of state. Pressure P as a function of volume V for the Im3m (left) and R3m (right) phases, calculated from the static 
energy Ego that represents the classical nuclei limit (black), and including the vibrational contribution within the SSCHA for both H3S (red) and D3S (green). 
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Extended Data Figure 2 | Bond symmetrization within LDA (left) and 
BLYP (right). For each volume V the relative coordinate x = (dz — a/2)/(a/2), 


in which d, is the length of the hydrogen bond and a is the lattice 
parameter, obtained at the energy minimum is given. When x =0, the 


covalent and hydrogen bonds have the same length and the structure is 
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fully symmetric. The energy is calculated at the static level without any 
vibrational contribution as derived from the BOES (black), and including 
the quantum anharmonic vibrational contribution both for H3S (green) 
and D3;S (purple). The pressure P below which the cubic structure distorts 
is given in each case. 
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Extended Data Figure 3 | Vibrational energy. SSCHA anharmonic vibrational energy (Eyi,"(x) — ne ™) calculated as a function of the relative 
coordinate x = (d2 — a/2)/(a/2) for different volumes (see legend). 
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Extended Data Figure 4 | Dependence of the equation of state on the of the structure with minimum energy is given, which depends on whether 
density functional. The equation of state is calculated with different or not the vibrational contribution is included (see Extended Data Fig. 2). 
exchange correlation functionals with (right) and without (left) the The results are compared with the two curves obtained experimentally’®. 


vibrational contribution to the pressure. At each volume V the pressure P 
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Extended Data Figure 5 | Phonons of Im3m D3S. Comparison between the harmonic and anharmonic phonons of Im3m at two different pressures for 


D3S: 133 GPa (left) and 155 GPa (right). 
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Extended Data Figure 6 | Anharmonic phonons of R3m H;S. Anharmonic phonons are shown at two different pressures: 153 GPa (left) and 133 GPa 
(right). The Eliashberg functions a?F(w) are also shown. 
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Extended Data Figure 7 | Superconducting properties of Im3m HS. Anharmonic Eliashberg function a*F(w) (solid lines) and integrated electron- 
phonon coupling constant \(w) (dashed lines) of the Im3m phase as functions of frequency w at two different pressures: 135 GPa (black) and 157 GPa (red). 
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Extended Data Table 1 | Birch-Murnaghan fit to the equation of state 
Bo (GPa) Vo (a8) By 


Classical nuclei 129.8 158.4 3.6 

H3S 86.63 176.4 3.9 

D3S 87.81 174.8 3.9 
Parameters of the fit to P(V) = 380 (Vo/Vy'/3—(Vo/ vye/sifa +2060! ~4)(Vo/v)2/3 — i} for the 


equation of state of the /m3m phase with classical nuclei, and with vibrational contributions for 
both H3S and D3S. Bo, bulk modulus at zero pressure; Vo, reference volume; Bo’, pressure 
derivative of the bulk modulus at zero pressure. 
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Extended Data Table 2 | Raman and infrared active modes 


Im3m 
Mode Degeneracy  R or | active? w (meV) 
H3S D3S 
135 GPa 157 GPa 133 GPa 155 GPa 
Tay 3 I 79.6 87.6 46.7 54.1 
Ti 3 I 131.0 152.1 95.4 109.8 
Tou 3 177.3 182.0 125.8 129.0 
R3m 
Mode Degeneracy Ror | active? w (meV) 
H3S D3S 
133 GPa 153 GPa 133 GPa 153 GPa 
E 2 R+1 111.3 112.2 74.4 73.7 
Ai 1 R+1 115.4 114.7 79.8 rewa 
E 2 R+1 142.4 144.6 105.6 105.9 
Al 1 R+1 168.7 158.8 123.2 115.0 
A2 1. 178.0 185.3 125.1 130.7 
E 2 R+1 181.0 186.7 127.0 131.7 


Phonon modes at the F point for the /m3m and R3m phases at different pressures calculated including anharmonicity within the 
SSCHA. The results for R3m are obtained keeping the centroid position at the minimum of the BOES. Raman (R) and infrared (I) 
activity is indicated for each mode. 
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Extended Data Table 3 | Superconducting parameters 


Compound _ P (GPa) IN Whog(meV) Tc (K) 

SSCHA Im3m 

H3S 135 2.10 104.9 214 

H3S 157 1.94 113.8 216 

D3S 133 2.45 70.5 163 

D3S 155 2.08 82.9 163 
SSCHA R3m 

H3S 133 1.62 105.8 163 

H3S 153 1.93 109.3 205 

D3S 132 1.64 79.8 123 

D3S 152 2.00 81.4 155 


Calculated A, wiog and T.. The pressure given includes the vibrational contribution. 
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Continuous directional water transport on the 
peristome surface of Nepenthes alata 


Huawei Chen!*, Pengfei Zhang, Liwen Zhang', Hongliang Liu’, Ying Jiang’, Deyuan Zhang!, Zhiwu Han‘ & Lei Jiang? 


Numerous natural systems contain surfaces or threads that enable 
directional water transport'~’. This behaviour is usually ascribed 
to hierarchical structural features at the microscale and nanoscale, 
with gradients in surface energy®® and gradients in Laplace 
pressure!” thought to be the main driving forces. Here we study 
the prey-trapping pitcher organs of the carnivorous plant Nepenthes 
alata. We find that continuous, directional water transport occurs 
on the surface of the ‘peristome’—the rim of the pitcher—because 
of its multiscale structure, which optimizes and enhances capillary 
rise!) !? in the transport direction, and prevents backflow by 
pinning in place any water front that is moving in the reverse 
direction. This results not only in unidirectional flow despite the 


Peristome 
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Figure 1 | In situ optical observation of directional water transport on 
the peristome surface. a, Optical images of a pitcher of N. alata, showing 
a conspicuous prey-trapping peristome (left) and a cross-sectional image 
of the peristome (right). The pitcher is 10-15 cm long, with the peristome 
at its upper rim. This peristome is arch-shaped, and the distance from the 
inner to the outer side is approximately 1-2 cm. b, A water droplet (mixed 
with blue ink) is dropped onto the peristome with a microlitre syringe; 
this droplet can overcome gravity to be directionally transported from 
the inner side to the outer side (but not in the opposite direction) within 


absence of any surface-energy gradient, but also in a transport 
speed that is much higher than previously thought. We anticipate 
that the basic ‘design’ principles underlying this behaviour could 
be used to develop artificial fluid-transport systems with practical 
applications. 

The tropical plant N. alata has a ‘pitcher’—a highly modified, 
prey-trapping organ, filled with digestive fluid—that is 10-15 
centimetres long; at the upper rim is an arch-shaped ring of tissue 
known as the peristome, which is about 1-2 centimetres wide from the 
inner to the outer side (Fig. 1a). The pitcher can digest insects to meet 
fundamental nutrient needs!*, after capturing these insects when they 


‘aquaplane’!’ on the peristome surface, which is completely wettable 


several seconds. The red arrows show the direction of water transport. 

c, Water transport from the inner side to the outer side is confined 
within a single large channel. The starting boundary line (marked by 
white rectangles) of deposited water does not change during transport. 
d, This enlarged partial image of the water boundary indicates that water 
transport is further confined within approximately ten microchannels, 
distributed within the large channel. Each microchannel consists of 
arch-shaped microcavities with a spacing of approximately 100 1m. The 
black parts in b-d mark the water droplet. 


1§chool of Mechanical Engineering and Automation, Beihang University, Beijing 100191, China. @Laboratory of Bio-inspired Smart Interface Science, Technical Institute of Physics and Chemistry, 
Chinese Academy of Sciences, Beijing 100190, China. ?School of Chemistry and Environment, Beihang University, Beijing 100191, China. “Key Laboratory for Bionic Engineering, Ministry of 


Education, Jilin University, Changchun 130022, China. 
*These authors contributed equally to this work. 
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Figure 2 | Surface features of the peristome in the pitchers of N. 

alata. a, The peristome surface exhibits a unique structural feature of 
two-order microgrooves. The width, L;, of the first-order microgroove 
is 461.72 + 49.93 um. The white arrow indicates the direction of the 
inner side of the peristome. b, Each first-order microgroove consists of 
approximately ten second-order microgrooves, each containing periodic 
duck-billed microcavities with arch-shaped edges. The width, L2, of the 
second-order microgroove at the valley of the first-order microgroove is 
50.18 + 6.18 1m. The numbers 1-4 correspond to the same numbers in 
panel d. c, A vertical section taken at the A-A position in panel b reveals 
that the duck-billed microcavity is closed at the top, with a sharp edge, 


by water from rain, dew or high humidity and by nectar secreted from 
extrafloral nectaries at the pitcher’s inner margin’®”. Slippery surfaces 
have been designed to mimic the structure and behaviour of the natural 
peristome surface!*!°, but questions remain about the mechanism(s) 
underlying its function. 

Here, we use high-speed camera observations to monitor water 
movement along the peristome surface (Fig. 1b). We find directional 
transport from the inner to the outer side: the water of a droplet depos- 
ited on the inner peristome margin moves to the outer margin within 
several seconds, while a water droplet placed on the outer margin 
cannot move towards the inside. In situ optical microscopic observa- 
tions (Fig. 1c) show that water transport is confined within a single 
large channel running perpendicular to the peristome rim; the water 
does not spread laterally beyond the width of the initial deposition 
area (marked by white rectangles in Fig. 1c; see also Supplementary 
Video 1; for water transport along several parallel large channels, see 
Supplementary Video 2). The higher-resolution image in Fig. 1d shows 
that water transport is in fact confined to approximately ten micro- 
channels distributed within the large channel, and that each micro- 
channel consists of arch-shaped microcavities with a spacing of roughly 
100 1m. These structural features ensure that water evaporating from 
the pitcher and condensing on the inner margin will spread over the 
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and has a slightly slanted outer surface. Neighbouring microcavities 
overlap (the approximate starting point of the overlap is marked by a white 
arrow). d, Each duck-billed microcavity possesses sloping (‘gradient’) 
wedge-shaped corners, marked by green arrows; gradient opening angles, 
3, are marked by dashed green lines. The overlap between neighbouring 
microcavities, marked by red arrows, becomes evident along the ridge. 

e, The opening angles of the wedge-shaped corners show that the 
gradient declines from roughly 90° to approximately 28° on the top of the 
microcavity (the error bars indicate standard deviations from at least five 
independent measurements). The insert image shows the height of the 
microcavity. 


entire peristome surface to make it more slippery, and are also likely to 
cause the directionality of the water transport. 

Detailed structural characterization of the peristome surface with a 
scanning electron microscope (SEM) reveals a two-order hierarchical 
arrangement of regular radial ridges, aligned so that they form a two- 
order hierarchy of parallel microgrooves (Fig. 2a; see Extended Data 
Fig. 1a for a cross-section). Each first-order microgroove—with a 
width, L), of 461.72 + 49.93 um (Fig. 2a)—contains about ten second- 
order microgrooves whose width, L2, increases from the ridge to the 
valley of the first-order microgroove (reaching 50.18 + 6.18 1m at 
the valley). Arch-shaped (or duck-billed) microcavities are regularly 
distributed along the second-order microgrooves (Figs 1d and 2b), with 
an overall upward slant and with the tops of the arches pointing towards 
the outer side. A vertical section through the peristome shows that 
the top sections of the microcavities are closed (Fig. 2c; see Extended 
Data Fig. 1b for side view), and that the enclosing surface is slightly 
slanted and has a sharp edge. The periodic microcavities overlap along 
the ridge (the starting point of which is marked by a white arrow in 
Fig. 2c), with the overlap between a given microcavity and its overlying 
neighbour becoming more evident on moving towards the top of the 
microcavity (red arrows in Fig. 2d). The sides of each cavity form a 
wedge (green arrows in Fig. 2d), with the opening angle, (3, declining 
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Figure 3 | Directional water-transport processes 
on the peristome surface. a, High-speed digital 
images showing how the starting boundary of 

a water droplet (dashed lines) changes after the 
droplet is deposited on the peristome surface. 
The final boundary coincides with the outline of 
the microcavity, indicating that water spreading 
was halted by the microcavity’s sharp edge. Water 
spreading from the outer to the inner side of 

air the peristome is stopped by being pinned at the 
sharp edge. b, High-speed digital images of the 
continuous water-transport process between 
overlapping duck-billed microcavities. As the 
water fills one microcavity, some water overflows 
to begin filling the next. The dashed lines show 
the boundary of water transport. c, Three- 
dimensional illustration of the water-transport 
process. The lower water layer, layer III, fills up a 
single microcavity (yellow arrows) and overflows 
to generate the upper water layer, layer II. Before 
the second microcavity is completely filled, the 
upper water layer (brown arrows) becomes the 
top water layer, layer I (green arrows), filling 

the third microcavity. Transport in the reverse 
direction does not occur because the water 

is pinned in place, with the water boundary 
coinciding with the sharp edge. 


from an initial value of approximately 90° at the bottom of the micro- 
cavity to roughly 28° at the top (Fig. 2e and Extended Data Fig. 2). 

In situ observation of the water-spreading process along a single 
second-order microgroove shows that the water contact line (indi- 
cated by dashed lines in Fig. 3a) after deposition coincides with the 
outline of the microcavity, indicating that water is pinned at the sharp 
edge of the microcavity. The edge angle, ¢ (the angle subtended by 
the two surfaces that form the sharp edge), of approximately 2° to 
8° favours water pinning, according to the Gibbs inequality~°, and 
prevents wetting from proceeding from the outer to the inner side 
of the peristome (Extended Data Fig. 1b; see Methods for further 
discussion). In contrast, water transport from the inner to the outer 
side occurs simply through continuous filling of a single microcavity. 
The outline of the water boundary (indicated by the dashed lines in 
Fig. 3b) shows that water initially spreads along the corner of the 
wedge, then pushes air out to fill the microcavity, and finally con- 
verges at the front of the microcavity (Fig. 3b and Supplementary 
Video 3; see Extended Data Fig. 3 for a complete illustration of water 
filling a single microcavity). Because neighbouring microcavities 
overlap, the filling of one microcavity begins before that of the preced- 
ing microcavity is completed. 

The overall result is continuous and directional water transport, as 
illustrated in Fig. 3c: when a given microcavity fills to the extent that 


its water level exceeds the top of its confining side wall, around the 
starting point of the overlap with the next microcavity (white arrow in 
Fig. 2c), the lower water layer (layer III; yellow arrows) overflows and 
generates the upper water layer (layer II), which then begins to fill the 
second microcavity (brown arrows). Before the second microcavity is 
full, the cycle repeats and the upper water layer (layer II) overflows to 
generate the top water layer (layer I; green arrow), which fills the third 
microcavity. Sequential filling of each microcavity achieves continuous 
water transport, while water pinning prevents wetting from occurring 
in the reverse direction. 

Wettability measurements show that the peristome surface is 
hydrophilic, and energy-dispersive spectrometer (EDS) measure- 
ments indicate the lack of a chemical gradient (Extended Data Fig. 4). 
To explore the role of surface energy further, we made artificial 
peristomes using a polydimethylsiloxane (PDMS) replica moulding 
method”! (Extended Data Fig. 5) and adjusted their surface energy 
using oxygen plasma treatment’. We detected directional water 
transport only with PDMS surfaces that are hydrophilic (Fig. 4a), with 
an intrinsic contact angle between the water and the surface of less 
than about 65°, which constitutes the newly proposed boundary”*4 
between hydrophobicity and hydrophilicity (the previously known 
contact-angle boundary was 90°). The artificial PDMS peristome 
with a superhydrophilic surface displayed the fastest water-spreading 
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Figure 4 | Mechanism of directional water transport. a, Directional 
water transport occurred on the artificial PDMS replica only when 

the PDMS was hydrophilic (that is, when the intrinsic contact angle 
between the water and the PDMS molecules was less than 65°); the water- 
transport speed was maximized when the surface was modified to be 
superhydrophilic (such that the intrinsic contact angle was less than 5°). 
Error bars indicate standard deviations from at least five independent 


speed, close to the transport speed measured for the natural peri- 
stome surface (approximately 78 + 12mms-!) (Fig. 4a). Figure 4b, c 
illustrates the transport experiments. No water transport occurred 
on the hydrophobic PDMS surface when water was placed on either 
the inner side or the outer side (Fig. 4b and Extended Data Fig. 6c, d). 
However, when the artificial surface was superhydrophilic, directional 
water transport occurred (Fig. 4c and Extended Data Fig. 6e, f) much 
as it does as on the natural peristome (Extended Data Fig. 6a, b). 
Superhydrophilicity is thus crucial for directional water transport 
in this system, and ensures that transport occurs on both the natural 
and the artificial peristome surface much more quickly than was seen 
in an earlier study documenting unidirectional wetting caused by an 
asymmetric surface structure”>. 

Superhydrophilic surfaces generally have no capacity for directional 
water movement unless they contain a chemical gradient*”. In the case 
of the peristome surface, which lacks such a gradient, the directional 
water transport arises from its unique structural features, which enable 
water to fill microcavities because of capillary rise at corners, as first 
described in 1712 (refs 11, 12). Specifically, this type of capillary rise 
occurs at the corner formed by two vertical intersecting hydrophilic 
plates corresponding to the wedge corner of the microcavity shown 
in Fig. 2c (Extended Data Fig. 7a, right), which the liquid can fill and 
use to rise unbounded provided that the Young contact angle, 0, of the 
liquid on the plate and the corner opening angle, a, satisfy a/2 +9 <90° 
(refs 26, 27). The capillary rise height, H.(x), is in this case given by the 
following equation’**°: 


__ 2ycosd 
pagx 


H,(x) 


where 9, p, 6 and g are the surface tension, the density of the liquid, 
the water contact angle and the gravitational constant, respectively. 
However, nature does not use simple corner capillary rise with a 
constant opening angle, but renders it more powerful by structuring 
the peristome so that there is a gradient: the opening angle a at the 
bottom of a given microcavity is larger than the opening angle a, at 
its top (Extended Data Fig. 2). For such a case of capillary rise, when 
1 > Q2 (Extended Data Fig. 7b, right), and assuming that the difference 
between a and 2 is infinitesimal (see Methods), the capillary height 
rise H.(x) can be presented as: 

ay— a 4y?cos76 
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where h is the height of the two intersecting plates. Comparison of 
the two equations shows that the capillary rise height in the case of a 


88 | NATURE | VOL 532 | 7 APRIL 2016 


70 80 


measurements. b, Water could not be transported on the hydrophobic 
PDMS artificial surface, from either the inner side to the outer side (left) 
or the outer side to the inner side (right). c, Water could be transported 
from the inner side to the outer side (left), but not from the outer side 

to the inner side (right), on the superhydrophilic artificial surface. The 
images in b and c show the final state after about 7 seconds (see Extended 
Data Fig. 6 for details). 


constant opening angle is obtained by setting a = a3, and that the cap- 
illary rise height is larger (or smaller) in the case of a > a2 (or ay < a2). 

Simple experiments (Extended Data Fig. 7) confirm the enhance- 
ment and reduction in water rise caused by the gradient of the opening 
angle and the opposite gradient. Interestingly, in a system in which the 
vertical corner leads to a top with a closed horizontal corner, water 
ascends along the vertical gradient and then continues to fill the top 
horizontal corner. This effect increases the volume of water that rises 
in the system by about 40% (Extended Data Fig. 8a), and prolongs 
the water-retention time to twice that seen in the system with a fixed 
opening angle (Extended Data Fig. 8b). In the peristome, the use of a 
microcavity structure that is symmetrical and closed at the top taps 
into these effects, and enables a much stronger water rise than would 
be possible by ordinary capillary rise. This, in conjunction with water 
pinning at the sharp edges of the microcavity in the reverse direction, 
enables continuous and fast directional transport of water along the 
peristome surface. Artificial systems exploiting these principles might 
find use in applications that call for directional fluid transport—for 
example, in agricultural drip irrigation or in the non-powered delivery 
of microdrugs. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Materials. Mature pitcher plants, N. alata, growing in a greenhouse were purchased 
commercially from Kunjiyuanyi Corporation (Guanddong province, China) and 
carefully maintained, covered with plastic film, in a small greenhouse under con- 
trolled conditions at a temperature of 20°C to 25°C and humidity of 75% to 90%. 
Characterization of the peristome surface. We recorded optical images of the 
pitcher of N. alata using a digital camera (600D, Canon, Japan). We observed the 
microstructures of the peristome surface using a SEM (CamScan-3400, CamScan 
Corp., UK) operating at a 20kV acceleration voltage. To avoid severe distortion 
caused by dehydration, we used a freeze-drying method to prepare the SEM spec- 
imens. Wettability was investigated by depositing a water droplet (approximately 
411) on the peristome surface, and the water-spreading process was monitored 
using an optical angle measuring system (SL200B, Solon, China) (Extended Data 
Fig. 4a). The chemical composition of the peristome surface was determined by 
EDS equipped with the SEM (Extended Data Fig. 4b). 

Observation of water transport. We used a high-speed camera (I-speed LT, 
Olympus, Japan) to record the water-transport process. For our initial observations 
of the macroscopic water-transport process (Fig. 1b), we first deposited a water 
droplet (mixed with blue ink, roughly 511 in total) on the inner margin of the per- 
istome surface using a microlitre syringe; then, we cut a partial peristome out of a 
pitcher and placed it under the eyepiece of an optical microscope (BX51, Olympus, 
Japan). For the observations reported in Fig. 1c, d and Supplementary Video 1, we 
used a microsyringe to deposit a small amount of water (about 0.111) on the inner 
surface of the peristome surface, and investigated the water-spreading process 
both in the microgrooves and in the duck-billed microcavity. When observing 
water spreading in two separate large channels (see Supplementary Video 2), we 
deposited about 0.2 11 of water. 

To observe the water-spreading process along a single second-order micro- 

groove (see Supplementary Video 3, Fig. 3b and Extended Data Fig. 3), we depos- 
ited approximately 0.01 jul of water on the inner margin of the peristome and 
monitored the top boundary of the water. To observe water pinning at the sharp 
edge of the microcavity, we deposited roughly 0.01 11 of water on the peristome 
surface and monitored the starting boundary of the water with the high-speed 
camera (Fig. 3a). The water contact line was identical to the sharp edge of the 
microcavity, indicating that water was pinned by the sharp edge running in the 
direction from the outer to the inner side of the peristome. To investigate direc- 
tional water transport on the peristome and its artificial surfaces, we placed the 
samples on a polyethylene plate and then deposited water on one side. Sequential 
images of water transport are shown in Extended Data Fig. 6. 
Fabrication of, and water spreading on, artificial replicas. We used a simple 
replica moulding method to fabricate artificial peristomes on an intact natural 
pitcher using polydimethylsiloxane (PDMS). SEM images (Extended Data Fig. 5) 
showed that all of the surface structural features of the peristome were accurately 
transferred to the artificial replica. The prepolymer was formed by mixing PDMS 
and curing agent at a mass ratio of 10/1. Before the prepolymer was cast onto the 
surface of fresh slippery zone, it was well stirred for 15 min, and then degassed in 
a vacuum oven for 15 min. The negative replicas of the pitcher were fabricated by 
conducting the following procedures: casting, curing in a vacuum oven, demould- 
ing. Then the negative replicas were treated by CF, plasma to form an anti-sticker 
layer. A second replication of PDMS was performed on the treated negative replicas 
in the same way as before, to obtain the replicas of the peristome. (For further 
details of sample preparation and fabrication, see ref. 21.) 

Modification of the PDMS surface by oxygen was conducted using RF plasma 

(P8C, Schwarze, China) at an RF power of 100 W for 5 min, a system pressure of 
100,:bar and a flow rate of 20s.c.c.m. After oxygen treatment, the PDMS was supe- 
rhydrophilic, and the contact angle gradually increased with time in air. Both the 
flat PDMS and the PDMS peristome replicas were fabricated and treated under the 
same conditions. The change in the contact angle of flat PDMS with time was meas- 
ured using an optical angle-measuring system (SL200B, Solon, China). Meanwhile, 
the water-spreading speed on the PDMS replicas (approximately 5,11 water was 
deposited on the inner side of the replicas) was monitored using the high-speed 
camera. Thus, we could obtain the relationship between the water-spreading speed 
on the PDMS replicas and the intrinsic contact angle of PDMS (Fig. 4a). 
Water rise along four types of wedge corners. Polyvinylchloride (PVC) plates 
were used in parallel experiments and were modified to be superhydrophilic by 
oxygen plasma treatment with an RF power of 100 W for 5 min, a system pressure 
of 100,tbar and a flow rate of 20s.c.c.m. The heights of the plates were limited to 
100 mm, owing to restricted sizes of the available material. Four types of wedge 
corners were built (Extended Data Fig. 7): (i) a fixed wedge with a bottom open- 
ing angle a equivalent to the top opening angle a; (ii) a gradient wedge where 
Qy > a2 and a is constant at 5°; (iii) a wedge of opposite gradient with a; < a2 and 
Q is constant at 90°; and (iv) a top-closed wedge with a blind top, that is, a2 =0°. 
The wedge corners were vertically immersed in water mixed with blue ink. 


Water-rise volume and water-retention time. To measure the water volume in 
the four types of wedge corner, we placed the PVC plates vertically in a small 
dish that perfectly fitted their size. We injected water into the dish to a depth of 
approximately 3 mm, and placed a round flat floater on the water. When water 
rose at the wedge corners, the height of the water surface changed. We used a laser 
displacement sensor with an accuracy of 1 |1m to measure the change in height 
of the water surface, by detecting the distance between the floater and the laser 
displacement sensor. The relationship between the change in height and the water- 
rise volume was calibrated by injecting various volumes of water into the dish 
and measuring the change in height of the floater. The accuracy of the water rise 
in this experiment was about 8j11;1m~!, and each case was measured five times 
(Extended Data Fig. 8a). 

To measure the time of water volatilization in the four types of wedge corners, 

we placed the PVC plates vertically in a ventilated heating furnace, and installed 
a digital camera to observe the amount of water. All of the wedge corners were 
injected with 5 ml of deionized water at the bottom of the corner. The temperature 
of the heating furnace was set to 45°C. We defined the volatilization time as the 
water-retention time of the wedge corner (Extended Data Fig. 8b). 
Mechanism of pinning. Water can be transported from the inner to the outer side 
of the peristome, but not from the outer to the inner side. This directional pinning 
probably occurs because of the cooperative effects of the sharp edge (Extended 
Data Fig. 1b) and the opposite-gradient wedge corners of the microcavity. More 
specifically, the upper cover of the microcavity has a sharp edge with an angle, ¢, 
of approximately 2° to 8°; and the peristome surface is superhydrophilic, with an 
equilibrium contact angle of about 0°. When the liquid wetting the edge satisfies 
the condition of Gibbs inequality”°: 


O <0 < (180° — d) + % 


(where @ is the contact angle measured through the drop at the edge), the edge 
will pin the liquid in place. The critical angle of 6 at the moment that the liquid 
contact line just crosses the edge can be calculated to be roughly 172° to 178°. 
We can see that when @ is smaller than this, the liquid will be pinned by the edge 
of the microcavity. The value of @ increases as the volume of liquid increases. 
However, the liquid on the peristome surface can be quickly transported away 
from the deposited position, and thus it is difficult for 0 to reach 172° to 178°; 
hence, the liquid can be pinned by the edge of the microcavity. When the liquid 
contact line crosses the edge, the liquid will fill the microcavity. Meanwhile, the 
opposite-gradient wedge corner can reduce the liquid rise from the outer side 
to the inner side. 

Theoretical analysis of capillary rise in a gradient wedge. When a wedge-shaped 
gap between intersecting plates is brought into contact with liquid, the liquid can 
rise along the intersecting corner (Extended Data Fig. 9a). In our system, the height 
of the liquid surface is determined only by the balance between the interfacial 
potential (Ginterface) and the gravitational potential (Ggravity). Then the total energy 
(Gtotat) of the whole system studied here is: 


Ghotal = Ginterface + Ggravity 


We choose the section of the two intersecting plates with an infinitely small thick- 
ness Ax to analyse the energy potential (Extended Data Fig. 9b). 
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Gis the gradient opening angle. Assuming (3, © 32 90° for simplicity, we get: 
Ginterface= 2H(x) x (—ycos@) 


where 7, 6 and H(x) represent the surface tension, the contact angle, and the height 
of the liquid surface at position x, respectively. 

Gravitational potential 
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(where p, g and h denote the liquid density, the gravitational acceleration, and 
the height of the intersecting plates, respectively), then the total energy (Giotal) is: 
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The height of the surface at equilibrium can be readily determined by minimiz- 
ing Gior with respect to H(x). It leads to: 


_ 2yc0s8 | ay — a 


Hx) He (x) (2) 


PZxary ayh 
And then, we can theoretically compare the liquid rise height in a fixed wedge with 
that in a gradient wedge, as follows. 
For liquid rise in a fixed wedge (that is, when a] = a2=«), we can obtain the 
liquid rise height: 


om 2ycosO 


Hi. 3 
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For liquid in a gradient wedge (when a; > a2), then, when the difference 
between a and ay approaches zero (so aj—a2— 0), we can approximately obtain 
the H.(x) by adopting the first-order correction to equation (2): 


2ycos? — ay — a2 4y7c0s76 
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Because a > a2, the second term at the right hand of equation (4) must be 
larger than zero. So, comparing equations (3) and (4), we can arrive at the 
relationship: 
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He(x)| ar>a2 > He(*)| ara 


For liquid rise in an opposite wedge, where a; < a2, we can directly get the 
relationship: 


He(x)| ar<a2 < He(*)| ai=ar 


On the basis of this theoretical analysis, we demonstrate that the liquid rise height 
at the gradient wedge corner (where a > a2) is larger than that at the fixed wedge 
corner (where a|= 2). The results indicate the gradient wedge corner in the 
microcavity helps the cavity to fill with water and helps with the continuing water 
transport. 

In fact, we can also quantitatively analyse the relative height of the liquid surface 
for a, # a2 compared with the case in which a; =a) in a simple way, as follows. 
In essence, the two components of the total energy of this system play competitive 
roles in determining the height of the surface: Ginterface Will raise He(x) while Ggravity 
will lower it. If we look at the following term of equation (1), that is, at 
* ay 3(x), then we can see that, if a, > a», this term will reduce the Ggravity 
thus: 


F(x)| asa > He(x)| a=ar 
But if a }< a4, this term will increase the Gravity, hence: 


H.(x)| ay<az < H.(x)| Q1=02° 
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Extended Data Figure 1 | The peristome surface and the microcavity. vertical section of a microcavity. The image reveals the microcavity’s 

a, SEM image of a cross-section of the peristome, showing that two-order, geometry in the plane perpendicular to the peristome surface, and shows 
regular radial ridges align perfectly to form regular two-order parallel that the microcavity has a sharp edge with an angle, ¢, of approximately 
hierarchical microgrooves. b, SEM image showing a side view of the 2° to 8°. 
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Extended Data Figure 2 | Cross-sections along the wedge corner of a single microcavity. a—h, Sequential cross-sectional images of the microcavity 
(moving deeper and deeper into the microcavity) show that it has a gradient wedge corner, with an opening angle that varies from roughly 90.5° to 28.2°. 
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Extended Data Figure 3 | How a single microcavity fills with water. and squeezes air out of the microcavity, rising into the cavity to fill it 
a, High-speed digital images of water filling a single duck-billed completely. b, The water-filling process; blue lines correspond to the water 
microcavity show that the water rapidly rises along the wedge corner boundary change with time. 
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Measuring points 


Extended Data Figure 4 | Wettability and chemical composition of the outer side. b, EDS measuring points along the second-order microgrooves, 
peristome. a, A water droplet (about 411; white circle) is dropped from and associated carbon/oxygen (C/O) values. These six values are almost 

a microsyringe and spreads quickly without a significant contact angle, the same as each other, indicating no notable chemical gradient along the 
indicating that the peristome is superhydrophilic. The water spreading microgrooves. 


also shows substantial directionality from the peristome’s inner side to the 
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Extended Data Figure 5 | SEM images of PDMS replicas of the peristome. Like the natural peristome surface, the surfaces of the replicas also possess 
two-order microgrooves (left); moreover, duck-billed microcavities (right) are distributed along the second-order microgrooves. 
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Extended Data Figure 6 | Directional water transport on natural directional water transport. c, d, A hydrophobic replica. Water could 
peristome surfaces and PDMS replicas. a, b, The natural peristome. not be transported from either the inner (c) or the outer (d) side. 
a, Water transport occurred when the water droplet made contact with e, f, A superhydrophilic replica. e, Water was transported from the inner 
the inner side. b, Water could not be transported when it made contact side to the outer side. f, Water could not be transported from the outer side 
with the outer side. Thus, the natural peristome has a capacity for to the inner side. 
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Extended Data Figure 7 | Experimental studies of water rise at various 
wedge corners. a, Capillary rise in a fixed wedge, in which the bottom 
opening angle, a, equals the top opening angle, a. The water rise 
decreases as the opening angle increases. b, Capillary rise in a gradient 
wedge, with a2 = 5°, and a > ay; here, water rise is higher than that in the 
fixed wedge. c, Capillary rise in a wedge with the opposite gradient to that 
in panel b; a2 = 90°, a < a2. Here, the water rise was the lowest of that 
seen in these four experiments. The red triangles in a-c show the height 
of the water rise. d, Capillary rise in a top-closed wedge, with a2=0°. 
The water can rise along the vertical wedge corner to fill the horizontally 


top-closed wedge independently of a). Diagrams showing the water rise 
are at the right of a-d. The heights were limited to 100 mm because of the 
restricted sizes of the available material. e, The experimental water-rise 
results show that the water rise generally decreases with increasing a}. 
Water rise in the gradient wedge (red line) is higher than that in the fixed 
wedge (black line); the top-closed wedge (yellow line) shows additional 
horizontal water transport. Water rise was the lowest in the opposite 
gradient wedge (green line). Error bars indicate standard deviations from 
at least five independent measurements. 
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Extended Data Figure 8 | Volume of rise water and water-retention b, Water-retention time in the four types of wedge. Water-retention time 
time for the four types of wedge corner. a, Volume of rise water in the for the top-closed wedge is the longest, and is about twice that for the 
four types of wedge—fixed wedge, gradient wedge, opposite gradient fixed wedge. Error bars indicate standard deviations from at least five 
wedge, and top-closed wedge. The volume of rise water for the top-closed independent measurements. 


wedge is the largest, and is about 40% greater than that for the fixed wedge. 
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Extended Data Figure 9 | Capillary rise in the gradient wedge, showing the plate and the horizontal plane (3; = 32); h, height of the intersecting 
relevant variables. a, Liquid rise in the gradient wedge corner. b, Section plates; w,», up width of selected small liquid element; wo, down width; 
view of the liquid rise in panel a. a;, bottom opening angle of a wedge; dz, thickness of the small liquid element. 

Q2, top opening angle; g, gravitational constant; 3) and (2, angle between 
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Nineteen -step total synthesis of (+)-phorbol 


Shuhei Kawamura!, Hang Chul, Jakob Felding? & Phil S. Baran! 


Phorbol, the flagship member of the tigliane diterpene family, has 
been known for over 80 years and has attracted attention from many 
chemists and biologists owing to its intriguing chemical structure 
and the medicinal potential of phorbol esters!. Access to useful 
quantities of phorbol and related analogues has relied on isolation 
from natural sources and semisynthesis. Despite efforts spanning 
40 years, chemical synthesis has been unable to compete with 
these strategies, owing to its complexity and unusual placement of 
oxygen atoms. Purely synthetic enantiopure phorbol has remained 
elusive, and biological synthesis has not led to even the simplest 
members of this terpene family. Recently, the chemical syntheses 
of eudesmanes’, germacrenes’, taxanes*® and ingenanes®° have all 
benefited from a strategy inspired by the logic of two-phase terpene 
biosynthesis in which powerful C-C bond constructions and C-H 
bond oxidations go hand in hand. Here we implement a two-phase 
terpene synthesis strategy to achieve enantiospecific total synthesis 
of (+)-phorbol in only 19 steps from the abundant monoterpene 
(+)-3-carene. The purpose of this synthesis route is not to displace 
isolation or semisynthesis as a means of generating the natural 
product per se, but rather to enable access to analogues containing 
unique placements of oxygen atoms that are otherwise inaccessible. 

Tigliane diterpenes have been viewed as promising leads in many 
different medicinal applications including immunomodulatory’, 
anti-viral!° and anti-cancer!! applications. The most progressed 
compound in this class of natural products is phorbol 12-myristate 
13-acetate, which is currently in phase II clinical trials for treatment 
of acute myeloid leukaemia. Subtle perturbations of their structures 
can have a marked effect on their biological profiles, perhaps as a 
result of differing protein kinase C subtype selectivity”'°. As such, 
phorbol esters and related terpenes have been a focus of natural 
products research’. Our interest in this family stemmed from the 
anti-cancer effects of certain phorbol analogues whose access was 
limited. Numerous eudesmane family members have previously been 
accessed in a concise way via a strategy for the chemical synthesis 
of complex terpenes that follows the underlying logic of biosyn- 
thesis’. This strategy has subsequently been successfully applied to 
germacrenes’, complex taxanes*? and ingenanes®®. In particular, the 
two-phase synthesis of (—)-ingenol (2, Fig. 1) is only 14 steps from 
(+)-3-carene (6) and has enabled access to a variety of analogues that 
were not accessible by any other means®*. Because ingenanes (4) 
and tiglianes (3) are closely related in nature, we hypothesized that 
intermediate 5, which is available in quantities of more than 100g, 
might serve as an ideal starting point for a short route to 1. The exe- 
cution of this plan depended on the invention of a simple solution 
to the challenge of incorporating the C12 and C13 oxygen atoms 
of 1; the difficulty in installing this functionality is well known in the 
field of organic chemistry. So far, only two total syntheses'?"4 and 
two formal syntheses'*'* of 1 have been reported in 40-52 steps (see 
Supplementary Information for a summary)'-'*. In these studies, an 
a-oxygenated enone was prepared from a ketone and subsequently 
cyclopropanated to build in the C13 oxidation via a six-step sequence. 
Many other efforts towards 1 have also been reported!” (for a full 
listing of the 36 papers in this area, see Supplementary Information). 


The route described in refs 17 and 18 is particularly instructive and 
illustrates the inherent challenge of using (+-)-3-carene (6) as a start- 
ing material to (+)-phorbol (1): 38 steps were required to reach a 
phorbol analogue lacking the C11 methyl and C13 oxygen from 6 
(refs 17, 18). (Here we define a reaction step as one in which a sub- 
strate is converted to a product in a single reaction flask (irrespective 
of the number of transformations) without intermediate workup or 
purification.) Our studies, outlined below, culminated in a two-phase, 
19-step synthesis of 1 that solves these problems in a concise way. 
The synthesis commenced with intermediate 5 (Fig. 2), provided 
in large quantities using a previously described route’. Installation of 
the C4 oxygen was accomplished using a Mukaiyama hydration and 
in situ silyl group installation to furnish 7 in 70% yield (gram scale). 
At this juncture, we pursued installation of the C12 oxygen atom, 
which required the selective oxidation of a methylene position in the 
presence of an enone, six tertiary C-H bonds and two other compet- 
ing methylene sites. To choose the proper oxidant for this transfor- 
mation, we analysed the structure computationally (Fig. 3a) and by 


"5 


Tigliane skeleton 


t, 


Ingenane skeleton 


Biosynthetic oxidase phase ill 
Key challenge 


Phorbol (1) 


OH 


e Previously 40-52 steps 
¢ Not enantioselective 


13 steps, 
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8 steps, 
ref. 6 
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(+)-3-carene (6) 


Figure 1 | A two-phase approach to ingenanes and tiglianes enables a 
concise approach to the phorbol structure. 
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Figure 2 | 19-step total synthesis of 1. Reagents and conditions are as 
follows. (7) Mn(acac)2, PhSiH3, O2, EtOH, then TMSOTE, EtsN, CH2Ch, 
0°C, 70%. (8) TFDO, CH2Ch, 0°C, then ZnIz, Mgls, EtO. (9) Mn(acac)., 
PhSiH3, O2, PPh3, EtOH, 2 steps (34% 10 + 62% 7). (10) RuCls, NaBrO3, 
NaHCO, EtOAc, CH3CN, H20, 96%. (11) TFAA, DMAP, CH2Ch, 0°C, 
then Zn, ACOH, CH2Clz, then Ac.O, DMAP, CH2Clhy, 0°C, then Et3N, 
DME, 60°C, 64%. (12) TSNHNH), MeOH, reflux, then NaBH3;CN, AcOH, 


inference of innate reactivity using NMR (Fig. 3b). Thus, we predicted 
the pseudo-equatorial C-H bond at C12 to be the most reactive on 
the basis of the following considerations": (1) steric shielding of the 
C6, C7, C8 and C11 positions would decrease their rate of oxidation; 
(2) the higher s-character of the tertiary cyclopropane C-H bonds 
(C13/14) makes them very difficult to oxidize; (3) of the remaining 
carbon centres, '3C NMR indicated that C12 is the most ‘nucleop- 
hilic’; (4) hyperconjugation from the 7-like cyclopropane system 
should facilitate oxidation of the pseudo-equatorial C-H bond on 
C12; and (5) strain-release might, to a small extent, accelerate such an 
oxidation’. Therefore, we chose the small, reactive, electrophilic 
oxidant methyl(trifluoromethyl)dioxirane (TFDO), owing to its 
straightforward preparation and success in other challenging methyl- 
ene oxidations*!. As predicted, TFDO cleanly achieved C-H oxidation 
at C12 to deliver the intermediate cyclopropyl carbinol intermediate 8 
(single diastereomer), which could be isolated and fully characterized 
on a gram scale. In practice, however, it was directly treated in the 
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reflux, 40%. (13) CrOs3, 3,5-DMP, CH2Ch, 0°C to room temperature, 
46%. (14) TMSN3, DCE, 70°C, then l, DCE, pyridine, 70°C. (15) Me4Sn, 
PdCl,(PhCN)2, AsPh3, Cul, NMP, 80°C, 2 steps (64% 18 + 33% 17). 

(16) HE - pyridine, THE, 0°C. (17) Martin Sulfurane, DCE, 60°C, then 
SeO», benzene, 80°C, 2 steps 51%. (18) NaBHy, MeOH, —40°C, then 
Ac,0, DMAP, CH2Ch, 93%. (19) NaBH(OAc)3, benzene, reflux, then 
TBAB, THE, 0°C, then Ba(OH),, MeOH, 72%. 


same flask with Mgl,/Zn1, to elicit a dehydrative ring opening” to 
furnish diene 9 along with unreacted enone 7. We did not observe the 
anticipated tertiary iodide product, presumably owing to rapid elim- 
ination to form a diene system. The mixture of 7 and 9 was directly 
subjected to another Mukaiyama hydration to furnish the tertiary 
carbinol 10 in 34% isolated yield from 7 along with 62% recovered 7. 
The addition of PPh; to this type of reaction has not previously been 
reported and was found to be essential to prevent the reactive inter- 
mediate peroxide adduct from generating an unwanted by-product 
(see Supplementary Information for details). 

With the cyclopropane ruptured, the next step was to install the 
C12/13 oxygenation pattern along with reclosure of the cyclopropane. 
This manoeuvre has no precedent, and needed to be accomplished 
rapidly and in a scalable fashion. To this end, chemoselective oxida- 
tion of the C12/13 olefin to diketone 11 took place in near quantitative 
yield using Ru/NaBrOs. To reform the gem-dimethylcyclopropane 
moiety and access the correct oxidation state, a two-electron reduction 
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Figure 3 | Predicting the site of C-H functionalization on intermediate 
7. a, Structural analysis of intermediate 7 calculated using MacroModel 
10.8 (Schrédinger LLC; http://www.schrodinger.com/citations/41/11/1/). 
Grey, carbon; white, hydrogen; red, oxygen; orange, silicon. b, '*}C NMR 
can be used to predict the most nucleophilic methylene C-H bond after 
taking into account the role of other factors (see text). 


of the diketone to an enediol would need to take place followed by 
intramolecular cyclization. Whereas attempts at acetylation of the 
hindered tertiary alcohol were fruitless and mesylation led to rapid 
elimination (as with the putative iodide intermediate in 8 — 9), treat- 
ment with trifluoroacetic anhydride smoothly led to an intermediate 
trifluoroacetate. This ester could be treated in the same flask with 
Zn to deliver an intermediate (12) whose structure was confirmed 
with the aid of X-ray crystallography. Although the desired reduction 
took place, it was accompanied by the formation of an additional, 
unwanted and bizarre C-C bond between C13 and the carbonyl 
group of the trifluoroester. Although aldol additions of enediols to 
aldehydes” and ketones” are known, there is no precedent for such 
reactivity with esters. We reasoned that this product originated from 
a highly reactive enediol intermediate (such as A; R= H) that might 
be regenerated under equilibrating conditions. In support of this 
hypothesis was the empirical observation that conversion of 11 to 12 
initially resulted in the formation of an approximately 1:1 mixture 
of 12 along with an isomeric species tentatively assigned as the dias- 
tereomeric aldol adduct. Over 24h this mixture equilibrated to 12 
exclusively, perhaps owing to stabilizing intramolecular hydrogen 
bonding interactions. We evaluated various sets of conditions with 
the aim of reintroducing the enediol (retro aldol) in the hope that 
concomitant irreversible cyclopropane formation (via attack at C15) 
would take place. However, treatment of 12 with catalytic Sc(OTf)3 
in DMF led instead to the cyclobutanone 13. Although clearly unde- 
sired, this was encouraging because 13 must have been derived from 
the desired hydroxy-cyclopropane B via a 1,2-shift, which in turn 
was produced via the enediol intermediate A (R =H). Indeed, the 
formation of cyclobutanones such as 13 from hydroxycyclopropanes 
has a precedent”. To circumvent this undesired pathway, interme- 
diate 12 was converted to hemiorthoester 14 by simple treatment 
with Ac,O followed by exposure to Et3N and gentle heating to 
60°C to deliver 15. The single flask conversion of 11 to 15 can be 
conducted on a gram scale, and presumably succeeds owing to the 
in situ formation of intermediate A (R= Ac), which prevents 1,2-shift 
after cyclopropanation. We believe that the conversion of A to the 
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desired cyclopropane is concerted because no elimination products 
were observed (see above). This reaction cascade consists of thirteen 
discreet events occurring in the same flask, forming two new C-C 
bonds, three new ring systems and three C-O bonds, along with the 
cleavage of one C-C bond, two ring systems and two C-O bonds. It 
unravels a complex polycyclic network to a kinetic endpoint using 
simple reagents thereby overcoming the greatest challenge posed by 1. 

The final eight steps of the synthesis served to install two addi- 
tional oxygen atoms, one methyl group and the proper C12 and C10 
stereochemistry. Attempts to install the key C10 stereocentre by way 
of conjugate reduction were met with failure despite extensive exper- 
imentation (see Supplementary Information for details). Thus, to 
achieve allylic transposition of 15, the weakest point of the synthesis 
from an efficiency standpoint, we used TSNHNH) under reductive 
conditions, which led to 16 in 40% yield. Allylic oxidation with CrO3 
(ref. 26) delivered enone 17 in 46% yield. A two-step methyl group 
introduction was accomplished by formation of an a-iodoenone 
(TMSN3, I5)?” followed by a Stille coupling”, which afforded 18 in 
64% overall yield along with 33% recovered 17. Selective removal of 
the C7 and C9 silyl groups with the HF - pyridine complex led to a 
diol that could be selectively dehydrated using the Martin Sulfurane 
reagent to an unstable olefin that was directly oxidized using SeO, to 
aldehyde 19 in 51% overall yield. The selective cleavage of the C7 and 
C9 silyl groups could stem from the inductive effect of the C3 carbonyl 
group, which could make the C4 silyl group less reactive to acids. X-ray 
crystallographic analysis of 19 confirmed the structural assignments 
made thus far. To complete the synthesis, 19 was reduced to the allylic 
alcohol and shielded as the acetate ester. Attempts to directly reduce 
the C20 aldehyde and the C12 ketone in 19 were unsuccessful, owing 
to the rapid rate of the C3 enone carbonyl reduction in the presence of 
the C20 free primary alcohol (the C20 hydroxy group is actually prox- 
imate to C3 as judged by molecular models). Stereoselective ketone 
reduction using NaBH(OAc); followed by successive treatment with 
fluoride and hydroxide sources (TBAF and Ba(OH)p, respectively) 
led to optically pure (+)-1 (synthetic, [a]*°p 105.37 (c 0.23, CH3OH); 
natural, [a]?’p 104.25 (c 0.20, CH3OH)) in 72% isolated yield (15mg 
prepared by this route). 

The concise route to 1 is enabled by a fundamentally different ret- 
rosynthetic approach to terpene synthesis in the laboratory, rather 
than by focusing on the invention of a new synthetic method; the 
newest method used in this synthesis is a Stille coupling, invented in 
the 1980’s”®. Although the development of new synthetic methodolo- 
gies are necessary to push the field of organic chemistry forward, this 
work emphasizes the equal importance of strategy design in the total 
synthesis of complex natural products”’. By holistically mimicking 
the biosynthesis of ingenol (2) in the laboratory®’, we exploited the 
interrelationship between ingenanes and tiglianes by using the same 
building block for both synthetic routes. Because two-phase terpene 
synthesis builds the carbon skeleton with a strategically planned redox 
state, the majority of steps can focus on installing key oxidations 
rather than repositioning functional groups and C-C bonds. Access 
to phorbol (1) requires only five additional steps compared to ingenol 
(2), owing to the presence of two additional oxygen atoms (C12/13) 
placed at particularly challenging locations on the carbon skeleton. 
Successful installation of these oxygenations and two others directly 
or indirectly relied on the application of C-H functionalization logic 
to pinpoint both the location and sequence of reactions!?”°. This arti- 
ficial oxidase phase will enable the synthesis of analogues containing 
deep-seated modifications in the same fashion that has recently been 
reported for ingenol (2)*. Although the route, in its current concise 
form, cannot compete with isolation for the procurement of large 
quantities of phorbol (1), we argue that it could be adapted to become 
truly scalable if the need arose. Rather, the purpose of this synthesis is 
to enable rapid access to new tigliane family members with exciting 
bioactivity that are either difficult or impossible to access through 
isolation or semisynthesis. 
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Northern Hemisphere hydroclimate variability over 


the past twelve centuries 


Fredrik Charpentier Ljungqvist!*, Paul J. Krusic*°, Hanna S. Sundqvist*°, Eduardo Zorita®, Gudrun Brattstrém*” & David Frank® 


Accurate modelling and prediction of the local to continental-scale 
hydroclimate response to global warming is essential given the 
strong impact of hydroclimate on ecosystem functioning, crop yields, 
water resources, and economic security!*, However, uncertainty 
in hydroclimate projections remains large*~’, in part due to the 
short length of instrumental measurements available with which 
to assess climate models. Here we present a spatial reconstruction 
of hydroclimate variability over the past twelve centuries across 
the Northern Hemisphere derived from a network of 196 at least 
millennium-long proxy records. We use this reconstruction to 
place recent hydrological changes*? and future precipitation 
scenarios”!!! in a long-term context of spatially resolved and 
temporally persistent hydroclimate patterns. We find a larger 
percentage of land area with relatively wetter conditions in the ninth 
to eleventh and the twentieth centuries, whereas drier conditions are 
more widespread between the twelfth and nineteenth centuries. Our 
reconstruction reveals that prominent seesaw patterns of alternating 
moisture regimes observed in instrumental data'?-™ across the 
Mediterranean, western USA, and China have operated consistently 
over the past twelve centuries. Using an updated compilation of 128 
temperature proxy records", we assess the relationship between the 
reconstructed centennial-scale Northern Hemisphere hydroclimate 
and temperature variability. Even though dry and wet conditions 
occurred over extensive areas under both warm and cold climate 
regimes, a statistically significant co-variability of hydroclimate 
and temperature is evident for particular regions. We compare the 
reconstructed hydroclimate anomalies with coupled atmosphere- 
ocean general circulation model simulations and find reasonable 
agreement during pre-industrial times. However, the intensification 
of the twentieth-century-mean hydroclimate anomalies in the 
simulations, as compared to previous centuries, is not supported 
by our new multi-proxy reconstruction. This finding suggests that 
much work remains before we can model hydroclimate variability 
accurately, and highlights the importance of using palaeoclimate 
data to place recent and predicted hydroclimate changes in a 
millennium-long context!®!’, 

Global warming is expected to strongly influence the hydrological 
cycle**!8°. Model simulations and theory suggest an intensification 
of the wet-dry contrast at lower latitudes” and a widespread expan- 
sion of dry areas over land*. These projections have yet to be robustly 
detected in recent instrumental observations®?. Furthermore, regional 
changes in precipitation patterns and drought intensity, in response to 
global warming, are challenging to predict owing to the chaotic nature 
of hydroclimate variability, the low spatial coherence of precipitation, 
and fine-scale processes related to cloud formation and precipitation, 
resulting in the large spread among model simulations®’. From proxy 
evidence, we know that hydroclimate anomalies of larger amplitude 
and longer duration than found in modern instrumental observa- 
tions have occurred in the past millennium: for example, the medieval 


‘megadroughts’ in the western USA”, and the monsoon failures in east 
Asia”! in the fifteenth-nineteenth centuries. This strongly suggests that 
the instrumental period is too short to capture the full range of natu- 
ral hydroclimate variability. In this light, a comprehensive aggregation 
and assessment of millennium-long moisture-sensitive palaeoclimate 
proxy data are needed to improve our knowledge of past hydroclimate 
variability, provide a test bed for model simulations, and facilitate the 
detection of anthropogenic impacts on the hydrological cycle’®. 

Here, we analyse the spatio-temporal patterns of hydrological anom- 
alies in an unprecedentedly large network of 196 hydroclimate proxy 
records from Northern Hemisphere land areas over the past twelve 
centuries. All are previously published as indicators of precipitation, 
drought, moisture balance, stream flow, lake level change, bog surface 
wetness or similar hydrological information derived from archives such 
as ice cores, marine sediments, lake sediments, speleothems, tree rings, 
and historical documentary data (Fig. 1a; Supplementary Table 1). All 
records span at least the past millennium, have at least two observations 
per century, and passed a screening procedure for geochronological dat- 
ing control (Methods). Using the same selection and screening criteria, 
we updated and improved the temperature proxy network from ref. 15 
(now 128 records), to investigate the co-variability between hydro- 
climate and temperature (Fig. 1b; Supplementary Table 2). 

To develop a spatial reconstruction we standardized each proxy 
record from its original units (for example, §80, ring width, millimetres 
of precipitation) to an index, ranging from —2 to +2 standard devia- 
tions, of centennial variation (Methods). Anomalies exceeding —2 or +2 
are considered outliers and truncated in order to minimize the influ- 
ence of a few ‘extreme values. This index is interpreted as a measure 
of relative water availability, analogous to the Palmer Drought Severity 
Index (PDSI)”°”!, which permits comparisons of relative variations in 
moisture conditions between arid and moist regions. 

This standardization is of particular importance given the different 
and potentially biased high- to low-frequency signal ratio in the various 
proxies. It essentially transforms the centennial-scale information in 
each proxy record to the same amplitude regardless of possible spectral 
biases (Methods). To extract a common spatial signal from the hetero- 
geneous proxy data, the indices are spatially averaged over an estimated 
correlation decay length (latitudinally dependent) of centennial-scale 
hydroclimate and temperature variability, respectively, derived from 
model simulations and instrumental data (Methods; Extended Data 
Fig. 1). Unlike field reconstruction methods that rely upon teleconnec- 
tions found in the instrumental period”, our technique yields a grid of 
hydrological anomalies that is independent of any spatial co-variability 
found in gridded instrumental observations’». 

Our results reveal a general tendency towards a larger percentage of 
land area with relatively wetter conditions during the ninth-eleventh 
and twentieth centuries, whereas the twelfth-nineteenth centuries 
show larger areas with relatively drier conditions (Fig. 2). Centennial 
periods of reconstructed widespread dry conditions occurred under 


1Department of History, Stockholm University, SE-10691 Stockholm, Sweden. @Centre for Medieval Studies, Stockholm University, SE-10691 Stockholm, Sweden. Bolin Centre for Climate 
Research, Stockholm University, SE-10691 Stockholm, Sweden. “Navarino Environmental Observatory, GR-24001 Messinia, Greece. Department of Physical Geography, Stockholm University, 
SE-10691 Stockholm, Sweden. ®Helmholtz-Zentrum Geesthacht, Institute for Coastal Research, DE-21502 Geesthacht, Germany. Department of Mathematics, Stockholm University, SE-10691 
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Figure 1 | Location map of the proxy records used in this study sorted 
by archive. a, Locations of the 196 hydrological proxy records for the past 
twelve centuries. b, Locations of the 128 temperature proxy records for the 


both warm and cold climate states (Fig. 3). Notably, the dry twelfth 
century (with 27% reconstructed hydroclimate anomalies <—0.5) is 
associated with positive forcing anomalies from high solar activity and 
moderate volcanic activity, whereas the equally dry fifteenth century 
(also with 27% reconstructed hydroclimate anomalies <—0.5) coin- 
cides with negative forcing anomalies from low solar forcing and 
several large volcanic eruptions”*. We identify inverse hydroclimate 
conditions (that is, hydroclimate dipoles) over geographically adjacent 
regions that persist on centennial timescales. This phenomenon has 
been reported in studies of instrumental data and some proxy time 
series for the same regions: the eastern and western Mediterranean!”; 
the southwestern and northwestern USA'%; eastern China and the 
Tibetan plateau’ (Fig. 2). 

To assess the robustness of our gridded reconstruction we con- 
ducted a number of validation tests. Recognizing the potential for low- 
resolution, less well dated, proxies to introduce temporal biases in the 
reconstruction, we produced a decadally resolved reconstruction from 
an even more strictly screened subset of proxies (Methods). At the grid 
point level the median correlation between the full reconstruction and 
the decadally resolved reconstruction is r= 0.97 (p < 0.01). Additional 
sensitivity tests, for example, excluding one proxy type at a time, exclud- 
ing or retaining records according to their seasonal signal, or their 
resolution (Methods; Extended Data Table 1), also reaffirm the robust- 
ness of the full reconstruction. Moreover, our reconstruction shows 
a relatively good correlation with tree-ring-derived reconstructions 
of PDSI, despite the differences between metrics and reconstruction 
seasonality (Methods; Extended Data Table 1)?°?!. We also compared 
our decadal reconstruction to decadal means of gridded instrumental 
precipitation data over the twentieth century. The median correlation 
is r=0.66 (p <0.05), but the short length of the instrumental data 
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o Lake sediments 
e Marine sediments 


180° 


past twelve centuries. All the records cover at least the period 1000-1899. 
We note that some proxies share the same geographical position but are 
based on different types of archives. 


precludes a full validation of the low-frequency behaviour in our hydro- 
climate reconstruction (Methods; Extended Data Figs 2 and 3). 

In regions with sufficient overlap between the hydroclimate and tem- 
perature reconstructions, we observe a tendency for wetter conditions 
during warmer periods in northern Europe, the eastern Mediterranean, 
Greenland, northern China, northwestern North America and the 
southeastern USA (Extended Data Fig. 4c, d). Drier conditions dur- 
ing warmer periods are seen in the central Mediterranean, central and 
northeastern North America, southwestern USA, and southern China. 
For certain regions (for example, northern China, northwestern North 
America, northern Europe and Greenland) the patterns of covariance 
between reconstructed temperature and hydroclimate are qualitatively 
consistent with the predicted signature of hydroclimate changes under 
future global warming”! 

With our new spatial hydroclimate reconstruction we evaluated the 
centennial distributions of gridded hydrological and temperature anom- 
alies (Fig. 3) against an identical extraction of precipitation and temper- 
ature anomalies from six state-of-the-art coupled atmosphere-ocean 
general circulation model simulations covering the period 850-2005 
(Table 1; Fig. 2c; Extended Data Figs 4a, b and 5). Both the proxy and 
model data agree that the thirteenth century had the most extensive 
dry conditions in the pre-industrial period. We observe that the seven- 
teenth century, the coldest Northern Hemisphere century of the past 
twelve centuries in both proxy and model data (Extended Data Fig. 
6a, b)'*!”, shows hydrological conditions close to the long-term mean, 
with a lack of extremes (only 6% reconstructed hydroclimate anomalies 
exceeding +£0.5). 

Overall, large-scale agreement in the temporal evolution between 
proxy and model anomaly distribution characteristics is observed 
from the tenth to the eighteenth centuries for both hydroclimate and 


Table 1 | Models used in this study from CMIP5, with simulations from 850 to 2005 


Atmospheric resolution 


Abbreviated model name Model institute (longitudinal x latitudinal) Forcing 
bec-csm1-1 Beijing Climate Center, China 2.81°x2.81° ref. 23 
CCSM4 National Center for Atmospheric Research, US DOE/NSF, USA 1.25° x 0.9° ref. 23 
GISS-E2R NASA Goddard Institute for Space Studies, USA 2.5° % 2° ref. 23 
HadCM3 Hadley Center, UK Met Office, UK 3.75°'x 1.875° ref. 23 
IPSL-CM5A-LR Institute Pierre Simon Laplace, France 3.75° x 1.89° ref. 23 
MPI-ESM-P Max Planck Institute for Meteorology, Hamburg, Germany 1.875° x 1.875° ref. 23 


CMIP5, Climate Model Intercomparison project Phase 5. For references to the individual models and additional information, see Methods. 
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Figure 2 | Spatial-temporal distribution of gridded centennial 
hydrological proxy anomalies. a, Gridded, weighted, centennial 
hydroclimate proxy anomalies, derived from the proxy data shown in 
Fig. 1a, over land areas with at least three independent proxies within 
the estimated centennial correlation decay length for centennial-scale 
hydrological variability. b, Time series derived from the reconstructed 
gridded weighted hydroclimate anomalies, showing the fraction of land 


temperature (Fig. 3). However, the similarities in reconstructed and sim- 
ulated hydroclimate begin to break down in the nineteenth century, with 
outstanding disagreement of the centennial distribution characteristics 
in the twentieth century. The proxy data continue to show hydrocli- 
mate characteristics in the twentieth century similar to those in the pre- 
industrial era, whereas the models simulate a high percentage (36.8%) 
of the land areas to be in either an extremely dry (21.4% of anom- 
alies <—1.5) or extremely wet (15.4% of anomalies >1.5) state. In 
contrast to the simulations, the proxy data show more extreme hydro- 
climate anomalies during centuries other than the twentieth (Figs 2 
and 3). The unprecedented intensification of wet and dry anomalies in 
the model simulations, presumably driven by anthropogenic forcing, 
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area exceeding a given wetness or dryness threshold. The red horizontal 
bars denote the 50% levels. c, The same as b but for median-model 
anomaly values of annual precipitation. All anomalies are shown relative 
to the centennial mean and standard deviation over the eleventh to the 
nineteenth centuries. The colour scale is truncated at —2 and 2 and areas 
with insufficient proxy coverage to compute a gridded weighted mean 
value are left white. 


results in an exceptionally strong meridional gradient producing a 
wetter north and drier south (Fig. 2; Extended Data Fig. 7a-f) not 
found in the proxy data. This produces a distinct bimodal distribution 
of simulated hydroclimate (that is, wet and dry extremes), whereas the 
proxy data reveal a unimodal distribution both in the industrial and 
pre-industrial period (Fig. 3). Assessment of interdecadal hydrocli- 
mate trends and their variances, in proxy and model data, shows that 
the differences in the twentieth century between proxy and model 
data are mainly restricted to the centennial-scale mean anomaly. The 
model and proxy data agree that neither the interdecadal variability 
nor the decadal trends in the twentieth century are unprecedented 
(Supplementary Tables 5 and 6). 
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Figure 3 | Probability density function of reconstructed and 
simulated Northern Hemisphere hydroclimate and temperature 
anomalies. Reconstructed weighted-smoothed gridded proxy (left) 

and median-model (right) centennial anomaly values of Northern 
Hemisphere hydroclimate (top) and temperature (bottom), showing the 
percentages of Northern Hemisphere land area within defined threshold 
values ranging from —2 to +2 in increments of 0.5 standard normal 
deviates. Superimposed on each distribution are the corresponding 
centennial anomalies of solar, volcanic, and CO) forcing time-series 
(Methods). 
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A persistent link has been suggested between Northern Hemisphere 
thermal conditions and the displacement of the Intertropical 
Convergence Zone (ITCZ)™*. During warmer periods the ITCZ moves 
northward, changing the position of Northern Hemisphere storm 
tracks, monsoon activity and precipitation patterns”’. A hypothesized 
mechanism for these changes is an expansion of the Hadley circu- 
lation, bringing more moisture and latent energy northward. When 
we assess the reconstructed hydroclimate patterns during the recon- 
structed warmest (tenth) and coldest (seventeenth) pre-industrial 
centuries in the Northern Hemisphere, we find evidence for strong 
and weak, respectively, meridional gradients supporting the proposed 
dynamics. In our reconstruction, the twentieth-century manifesta- 
tion of these dynamics cannot be seen to the extent reconstructed in 
the tenth century or in the twentieth century in model simulations 
(Fig. 2b, c; Extended Data Fig. 7). These findings could corroborate 
the suggestion by ref. 24 that ITCZ migration, owing to anthropogenic 
greenhouse gas forced warming and possibly tropospheric aerosols, 
may produce a hydroclimate regime that differs from past regimes. 
An alternative explanation is that the spatial signature of hydroclimate 
can be highly variable under comparable thermal conditions, making 
it difficult to attribute a particular external forcing to the observed 
patterns. 

There are several possible explanations for the limited ability of 
climate models to simulate hydroclimate variability accurately: knowl- 
edge gaps in describing physical processes defining model sub-grid 
parameterizations®”®, systematic absolute temperature biases in the 
models affecting evaporation and precipitation processes”’, and an 
underestimation of internal hydroclimate variability**. We find only 
limited regional-scale reconstruction—model agreement (Fig. 2a—c and 
Extended Data Fig. 4a, b), as would be expected if Northern Hemisphere 
hydrological variability is driven more by internal dynamics 
than external forcing. 

Our comparison of the distribution characteristics and large-scale 
temporal evolution of reconstructed and simulated centennial-scale 
hydroclimate reveals strong similarities up to the nineteenth century. 
However, the proxy evidence does not support the tendency in sim- 
ulations for wet regions to become wetter and dry regions drier in a 
warmer climate’”*". Furthermore, our hydroclimate reconstruction 
does not support a general unprecedented intensification of the 
hydrological cycle in the twentieth century, associated with both more 
extreme wet and dry conditions, as simulated by an ensemble of models. 
This finding is in line with recent analyses of instrumental data reporting 
limited evidence for an intensification of wet and dry anomalies under 
current global warming®””®. Absence of evidence for this predicted 
hydroclimate response to higher temperatures suggests that either the 
warming in the twentieth century has not been strong enough for this 
fingerprint to emerge, or that the triggers for the mechanisms responsible 
should be revised. We conclude that a millennium-long perspective, 
as provided by multi-proxy evidence, is crucial to our ability to bench- 
mark and evaluate’®!” hydroclimate simulations accurately. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


The proxy data collection. We systematically searched the scientific literature for 
proxy records of hydroclimate and temperature. We retrieved all records fulfilling 
the following criteria: 

(1) Temporal coverage. All proxy records must cover at least the period 1000-1899. 
(2) Temporal resolution. All proxies must have at least, on average, two data points 
per century between 1000 and 1899 and no more than two centuries over the 
1000-1899 period may have fewer. In the data subset containing only hydroclimate 
records resolved decadally or better, we required that all proxies have, on average, 
one data point per century between 1000 and 1899. 

(3) Publication requirements. We use only proxy records that have been published 
in the scientific literature. 

(4) Requirements regarding the climate signal. The proxies must, according to their 
published description, contain a hydrological signal described as precipitation, 
drought, moisture balance, stream flow, lake level change, bog surface wetness, 
flooding or similar hydrological information. 

(5) Geographical requirements. All the proxies must be located in, or nearly in, the 
Northern Hemisphere. Proxies just south of the Equator (by a few degrees) may be 
included if they contribute to gridded values north of the Equator. 

The majority of records were obtained from public repositories (for example, 
http://www.ncdc.noaa.gov/paleo/ and http://www.pangaea.de/). Those records not 
in the public domain were acquired either by direct request from their authors, 
or digitized from figures in the original articles (Supplementary Tables 1 and 2). 
The proxy data were divided into seven categories: (1) ice cores, (2) peat cores, 
(3) tree rings, (4) speleothems, (5) documentary, (6) lake sediments, and (7) marine 
sediments. In cases where there exist several versions of a proxy record from the 
same site, preference was given to the latest published version. The collected 
proxy records were divided into three simplified categories of seasonal response: 
annual, winter and summer, with spring and early autumn considered as part of the 
summer season. In cases when no clear information about a proxy’s seasonality 
could be found, either in the original article or in subsequent publications, we 
assumed the seasonality to be annual. 

Geochronological screening. We only included proxy records with age models 
constrained by at least one dating point after 1900 (D1), another between 1000 and 
1900 (D2) and one dating point before 1000 (D3). The number of years between 
D1 and D2 and D2 and D3 should be less than 1,000 years. 

The dating criteria are: 

(1) The top of the core exists (that is, year of collection) or, if it does not, there must 
be a dating point (for example, '*’Cs, 7”°Pb) to verify the age (A1 is the top age). 
(2) There should be at least one dating point (‘4C, U/Th, tephra, 7!°Pb) between 
800 and 1999 (A2). 

(3) There should be at least one dating point between 800 and 1 (A3). 

(4) The spacing between these three dating points should be less than 800 years 
(Al — A2 < 800 years, A2 — A3 < 800 years). 

(5) Al, A2 and A3 cannot be bulk datings. 

(6) The acceptable dating errors for Al, A2 and A3 must be <200 years (20). 

For the decadal- or better-resolved subset, each record must have at least one dating 
point per full century. 

Data treatment. Prior to computing centennial anomalies, all proxy records 
with irregularly spaced time steps were converted into time series with annually 
spaced time steps using simple linear interpolation, then smoothed by a cubic 
smoothing spline having a 50% frequency response at 100 years. The smoothed 
proxy series are transformed to standard normal deviates with respect to the 
1000-1899 baseline period. From the resulting centennial anomalies every 25th 
value is extracted from each century with at least 85 (interpolated) values. From 
those proxy records spanning the complete 800-1999 period, we obtain 45 
centennial anomalies. Of these 45 centennial anomalies the 12 representing the 
hundred-year periods 800-899, 900-999, 1000-1099, 1100-1199, ..., 1900-1999, 
are those presented in the majority of figures throughout this Letter (Extended 
Data Fig. 8b). However, all 45 centennial anomalies are used in the correlation 
experiments as well as in Fig. 2b and c. For the subset including only hydroclimate 
records decadally resolved or better, we obtained decadal anomalies using the 
same method as described above but instead using a spline with a 50% frequency 
response at 10 years. 

Correlation decay length. Because individual proxy records can contain noise, 
amore robust signal is presumably obtained by producing an average of multiple 
proxies from within a geographic area where hydroclimate can be assumed 
to be coherent. Unfortunately, the size of such geographic areas, on centennial 
timescales, is not well constrained owing to the limited length and distribution of 
instrumental observations. This deficiency required the development of a measure 
of correlation decay length (CDL)*! of hydroclimate*** on centennial timescales. 
Encouraged by the agreement found between instrumental precipitation data on 
annual and decadal timescales*4 and simulated values from the ECHO-G model?° 
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on the same timescales, we used the simulated centennial values from the ECHO-G 
model to calculate a conservative estimate of the spatial auto-correlation function 
for hydroclimate on centennial timescales by the following equation: 


1 


CH= Df fle bsifl kd) (1) 


where t is the time index, i and j are the longitude index and the latitude index of 
the grid, respectively; f(t, i, j) are the deviations from the local long-term mean at 
time t and at a grid-cell (i, j) divided by the local temporal standard deviation (that 
is, standardized to one standard deviation), d is the spatial separation between grid 
cells (i, j) and (k, 1). The sum of the product f(t, i, j)f(t, k, J) over time and space 
is restricted to the grid cells (k, /) that are separated from the grid cell (i, ]) by the 
distance between L— AL and L+ AL, where AL is 100km. This sum is divided 
by the number of occurrences of grid-cell pairs. The outer sum is calculated over 
all grid cells located on an index circle j, and then this sum is divided by that 
number of grid cells. It is thus assumed that the spatial autocorrelation is isotropic 
and zonally homogeneous, that is, it depends only on length L and on latitude j. 
The assumption of isotropy and zonal homogeneity is an assumption that is not 
completely fulfilled for all terrestrial locations. However, this approach yields an 
approximate and average estimation of the decorrelation length at multi-decadal 
to centennial timescales, and this estimation amounts to a few hundred kilometres 
(Extended Data Fig. 1a). The precise value of the decorrelation length does not 
affect the basic features of the reconstruction. A simplified CDL function of hydro- 
climate variability on centennial timescales was derived from the output of the 
climate model ECHO-G* over the period 1000-1990 to calculate the maximum 
search distances as a function of latitude in this study (Extended Data Fig. 1b). 
Averaging, weighting and gridding. We use the distance function described 
above, derived from the estimated centennial hydroclimate CDL, to define the 
search radius for averaging proxy anomalies (Extended Data Fig. 1a and b). Finding 
the decorrelation of hydroclimate increases with distance from a search centre, 
we assign a weight for each proxy record, for any given latitude, decreasing from 
1 at the centre to e * ~ 0.14 at the search periphery using the following Gaussian 
weight function: 


we 28?/R (2) 


Here, w is the weight assigned to a proxy value located at x distance from a grid 
node, and R is the search radius defined by equation (1). Every reconstructed grid 
cell location is required to have three or more proxy centennial anomalies within the 
search radius. This requirement effectively reduces the total number of proxies used 
in the experiment, although some regions remain densely covered (such as North 
America, China and Europe; Extended Data Fig. 8a). Weighted means are computed 
for all 45 centennial anomalies and are those used in the correlation analyses. All 
gridding and weighting of proxy (and model) data are performed using the method 
described above for producing the proxy local weighted averages, although the 
search area centres are moved to each intersection of a 1° x 1° longitude by latitude 
grid superimposed over a polar projection of the Northern Hemisphere. Maps of 
centennial gridded proxy hydroclimate variability are shown in Fig. 2a, and time 
series corresponding to the fraction of land area at or above a given wetness or dry- 
ness threshold are shown in Fig. 2b. The weighted and gridded proxy temperature 
reconstruction shown in Extended Data Fig. 6a and used in Fig. 3 is produced in the 
same manner as the hydroclimate reconstructions, with the exception of the CDL 
function used for weighting and gridding. Detailed descriptions of the CDL, the 
proxy weighting function, and the search distance function used for reconstructing 
Northern Hemisphere centennial temperature variations are given in ref. 15. 
Climate model data. We have used simulations from six coupled atmosphere— 
ocean general circulation model simulations that are part of the Climate Model 
Intercomparison project Phase 5 (CMIPS5)**. The simulations denoted as ‘past1000” 
cover the period 850-1850, whereas the ‘historical’ simulations cover the period 
1851-2005. The ‘past1000’ simulations are sometimes performed using a lower 
resolution or simplified versions of the more comprehensive models used for the 
‘historical’ simulations. We have restricted our analysis to those six simulations— 
bec-csm1-1°7, CCSM4”8, IPSL-CM5A-LR”, GISS-E2R”, HadCM3", and MPI- 
ESM-P“’—conducted with the same model versions for both periods and stitch 
the ‘past1000’ and ‘historical’ simulations that carry the same ensemble run- 
initialisation-physics (rip) identifier in the CMIP5 repository (Table 1). However, 
the simulation conducted with the MIROC-ESM model was not used owing to 
a documented, unrealistic, long-term positive temperature trend in global mean 
annual near-surface temperature over the past millennium”. 

The models were driven by estimations of external forcing, including varia- 
tions in the orbital changes, total solar irradiance, volcanism, atmospheric trace 
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gases and land-use changes. The CMIP5 Consortium recommended similar, but 
somewhat different, options for these external forcings’** and the modelling 
groups did not always use the same options for all CMIP5 simulations. From each 
of the six simulations we extracted the annual mean precipitation and temperature 
at the model grid-cell location nearest to our proxy locations. Once the selection 
was made, an equal length extraction was performed to make all comparisons 
between model and proxy as equitable as possible. The smoothing, weighted aver- 
aging and gridding of all model data followed the same protocol as described above 
for the proxy data. The centennial forcing anomalies in Fig. 3 are produced from 
time series of solar*®, volcanic*® and CO, forcing’” the same or similar to those 
used in the models. 

Calculation of correlations. All temporal correlations between two weighted 
and gridded data sets are calculated by Pearson product-moment correlation 
coefficient**, Assessment of the statistical significance of these correlations is com- 
plicated by the fact that any two data sets are both temporally and spatially correlated. 
We have taken temporal correlation, that is, autocorrelation, into account by using 
the method of blocked bootstrap (3,001 repetitions)*” to provide a more realistic 
estimate of the correlation’s statistical significance. In Extended Data Fig. 4a 
we show the correlations between the gridded hydroclimate proxy data and 
corresponding gridded hydroclimate model data and Extended Data Fig. 4b 
shows the statistical significance of the correlations. In Extended Data Fig. 4c we 
show the correlations between the gridded hydroclimate proxy data and gridded 
temperature proxy data and Extended Data Fig. 4d shows the statistical significance 
of the correlations. 

We produce histograms of all cross-correlations in this study (Extended Data 
Fig. 9a-d) and their Fisher z transform*? (Extended Data Fig. 9e—-h) to assess the 
normality of their distributions. A slightly bimodal distribution is seen, for both 
the original and Fisher z-transform correlations, between the gridded hydroclimate 
proxy data and gridded temperature proxy data. This is not what we would expect 
to see if hydrological and temperature anomalies were unrelated: the distribution 
of the Fisher z transform of the correlations should then be approximately normal 
and, in particular, unimodal. Instead, the histograms suggest that the distributions 
of the correlations between proxy hydroclimate and proxy temperature values are a 
mixture of two unimodal distributions, corresponding to two distinct mechanisms 
relating hydroclimate and temperature. 

Validation tests. To assess the robustness of our gridded weighted hydroclimate 
reconstruction we have produced a number of ‘subset’ reconstructions, using 
different data screening schemes, as well as comparing our multi-proxy recon- 
struction to other hydroclimate products. We created a subset of proxies with 
only those records that have decadal to annual resolution and at least one dating 
point per century. The gridded reconstruction obtained from this subset has less 
spatial coverage than the reconstruction from the full data set. The median Pearson 
product-moment correlation coefficient, at grid point level, between the two recon- 
structions (using 45 centennial values, lagged by 25 years) is r= 0.97 (p< 0.01), 
suggesting that any potential dating uncertainties that exist in the full data set have 
a small influence on our overall results (Extended Data Table 1). Furthermore, we 
correlated the full reconstruction with reconstructions produced excluding one of 
the seven proxy types at the time, which notably also involves excluding the numer- 
ous but less well age-constrained lake sediment records (Extended Data Table 1). 

Moreover, we produced similar tests excluding/retaining proxy records reflect- 

ing quantitative estimates of precipitation as well as excluding/retaining proxy 
records with annual signal, summer signal, and winter signal, respectively. Finally, 
we correlated our reconstruction from the full data set as well as the reconstruction 
from the more strictly screened data set resolved decadally or better, set against 
the North American Drought Atlas”® and the Monsoon Asia Drought Atlas”". At 
grid point level the median Pearson's correlation is r= 0.34 (p < 0.15) and r=0.50 
(p < 0.03), respectively. These significant, but not very high, correlation values are 
acceptable considering our hydroclimate reconstruction metric is not an equivalent 
measure of the PDSI°!” and furthermore that the PDSI reconstructions are for 
the summer season only. 
Potential spectral biases in proxy data. The Earth’s climate system contains 
variability from sub-daily to Milankovitch to deep geological timescales!7°3-°°. 
It has been known for a long time that proxy records may record climatic variation 
occurring on different wavelengths (that is, interannual to decadal to centennial to 
millennial) with varying fidelity resulting in disproportionate manifestations of the 
variance across these different wavelengths**. Such ‘spectral biases’ are a type of 
noise that, if not considered or corrected, potentially challenge a precise quantita- 
tive assessment of the continuum of climate variability. Using instrumental records 
to characterize and subsequently correct the possible spectral biases in proxy 
records should yield more accurate reconstructions of climate. However, we note 
that this endeavour to characterize proxy biases is questionable, because the true 
long-term characteristics of the climate system remain broadly unknown, which 
is part of the motivation to develop high-quality proxy data and reconstructions. 


Knowledge of proxy data sets is, however, sufficiently advanced to state that 
individual proxy types may be characterized by one or more biases. For example, 
tree-ring data, arguably the most intensively investigated proxy archive for the 
past millennium in both the temporal and frequency domains, are known to have 
two potentially opposing biases. On the one hand, owing to a ‘biological memory, 
tree-ring width data especially contain somewhat less high-frequency (interannual) 
variability than instrumental temperature (or precipitation) data“. On the other 
hand, the necessary removal of the long-term age/size-related trends in tree-ring 
records, if performed using certain techniques, results in the so-called ‘segment 
length curse’® that limits our ability to preserve trends/variability exceeding 
the length of the mean length of the individual tree-ring sequences. In this case, 
diminished low-frequency (centennial to millennial) variability would be expected. 

There are, however, different techniques that can be applied to overcome or 
mitigate such spectral biases. The removal of the age-related trends using a sin- 
gle population estimate as in the Regional Curve Standardisation technique has 
been shown to break the ‘segment length curse’ and to preserve long wavelengths 
better®* ©. Similarly, the biological memory can be estimated and removed by 
autoregressive modelling and the resulting time series can be assigned the spec- 
tral properties expected from the instrumental data sets°®. Analyses of long-term 
instrumental data sets are limited to the past couple of centuries, which does not, 
as also mentioned in the main text, allow a full validation of the low-frequency 
(for example, centennial to millennial timescale) components of proxy data or 
reconstructions. 

In most proxy records, just as with the tree-ring data, the low-frequency vari- 
ability can be affected by nonlinear processes such as the compression or mixing 
of layers, a time-varying temperature-precipitation relationship, anthropogenic 
impacts on the local environment and, in the case of ice-core data, ice-flow move- 
ments in the ice cap. It should be noted that the interpolation of non-annual records 
to annual resolution also results in a spectral bias towards lower frequencies. 
Given the uncertainties back in time in many individual proxy records, both in the 
high- and low-frequency signal, a multi-proxy approach, using the average signal 
from a number of disparate proxy records, is arguably best suited for the study of 
low-frequency climate variability. 

In multi-proxy reconstructions, where the aim is to obtain both the high 
(annual) and low (centennial) frequency signal, any spectral bias in the proxies, 
influencing the ratio of the high- and low-frequency signal, is a problem of major 
importance. Our aim, however, is only to obtain the signal on centennial and 
longer timescales. By standardizing centennial mean values, proxy records that 
have both strong and weak low-frequency signals are transformed to records of just 
centennial-scale information of the same amplitude. This transformation essentially 
eliminates the influence of spectral biases by making irrelevant whatever prior ratio 
of high- and low-frequency information had existed in the proxy before standard- 
ization. Even if a proxy does not capture the full amplitude of centennial-scale var- 
iability (for example, tree-ring width chronologies built by many short individual 
series and/or limited replication®) the impact is small because the standardization 
process gives all proxies the same amplitude of centennial-scale variability. As long 
as the proxies have some centennial-scale variability (which is the only variability 
we retain for our primary reconstruction) the proxies are useful to us. 

Our different sensitivity tests (Extended Data Table 1), removing various proxy 

types from the complete data set and correlating their resulting reconstruction with 
the full reconstruction, demonstrate a very limited impact of potential spectral 
biases from particular proxy types on our full reconstruction. These experiments 
produce remarkably similar hydroclimate reconstructions, suggesting that the 
purported negative effect of ‘mixing’ proxies with potential spectral biases is not 
large (if not altogether mitigated by the standardization). Especially important 
in this regard are results from those tests in which all the records with less than 
decadal resolution, with only centennial dating control, and those in which all 
the tree-ring records and all the lake sediment records are excluded, respectively. 
Not surprisingly, the reduction in proxy coverage gave rise to some spatially local 
differences; however, the overall patterns remained the same. Most of the changes 
seen are less likely to be attributable to spectral biases in the proxies, but rather to 
a diminishment in the considered number of proxies, and hence greater noise in 
the reconstruction. We mitigate the impact of proxy noise in our reconstruction 
approach by requiring multiple records within the search radius for every grid 
point reconstructed. 
Data and code availability. All the proxy data and programs used to perform the 
experiments herein have been made publicly available at NOAA Paleoclimatology/ 
World Data Center for Paleoclimatology (https://www.ncdc.noaa.gov/paleo/ 
study/19725). The Source Data we used to plot each figure in the Letter are also 
stored there. The suggested computing environment and dependencies are: Mac 
OSX 10.6 or greater, Generic Mapping Tools (GMT) 4.5.6 or later®’, Ghostscript 
9.10, and Matlab2007b or later. A detailed description of the software used, with 
example data and runtime commands is supplied with the archived data. 
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The climate model data included in the simulations of the Climate Model 
Intercomparison Project Version 5 (CMIP5) can be downloaded from any of 
the nodes of the Earth System Grid Federation (registration is required and a 
password is provided after registration), for example from: https://esgf-data.dkrz. 
de/search/cmip5-dkrz/. The menu on the left side of the page allows the user to 
select the ‘experiment’ (in our case, ‘past1000’ and ‘historical’), the model (for 
example, MPI-ESM-P), the variable (pr = precipitation), and the time frequency 
(mon= monthly). A click on the search button displays all matching results. The 
data download can be performed interactively by clicking on the links of the indi- 
vidual files or by downloading a Unix script ‘wget which can be locally run on 
a Unix computer. All files are written in ‘netcdf’ format, which also includes the 
metadata information, and that can be accessed with a variety of software tools, for 
instance with the public statistical packages R, Climate Data Operators (CDO) or 
netcdf Operators (NCO). Data of the ECHO-G simulation can be retrieved from 
the CERA data bank of the German Climate Computing Centre (DKRZ) at http:// 
cera-www.dkrz.de/. An account on the CERA data bank is required to access the 
data and can be obtained by sending an email to data@dkrz.de. CERA will then 
provide a personal account and password as soon as possible. Once an account has 
been created, the data of the ECHO-G simulation can be accessed. 
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Extended Data Figure 1 | Estimated correlation decay length values. 
a, The spatial decorrelation function p for centennial mean precipitation 


estimated from the output of the climate model ECHO-G** over the period 


1000-1990, following the procedure described in the Methods for the 
estimation of the centennial correlation decay length for hydroclimate 
variability. Distance is the correlation decay length from one point in 
kilometres. The different colours represent the latitudinal bands. 


Distance (km.) 


b, An example of the estimated spatial autocorrelation function for 
centennial mean values of precipitation at latitudes 81.25° N, 80.00° N 
and 78.75° N, respectively, with the decorrelation length for latitude 
80° N indicated in red. c, The simplified CDL function of hydroclimate 
variability at centennial timescales, derived from panel a, is used 
throughout the study to calculate the maximum search distances as a 
function of latitude. 
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Extended Data Figure 2 | The fraction of land area, expressed as decadal —_ annual precipitation. c, The same as a and b but for the Global Historical 


means for 1900-1999, exceeding a given wetness or dryness threshold Climatology Network 5° x 5° (GHCNS) instrumental annual precipitation 
in the gridded reconstruction, model simulations, and instrumental data. All decadal average values are standardized over the 1910-1979 
precipitation data. a, Weighted gridded proxy reconstruction derived period, and model and instrumental values are extracted from grid cells 
from the subset containing only hydroclimate records resolved decadally covered by gridded reconstructed values. The red horizontal bars denote 
or better. b, The same as a but for median-model anomaly values of the 50% levels. 
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Extended Data Figure 3 | Boxplots showing decadal anomaly values 
of instrumental data, our gridded reconstruction, drought atlas data, 
and model simulation of precipitation over the 1900s. a, Comparison 
of decadal anomalies between our gridded Northern Hemisphere (NH) 
hydroclimate reconstruction, the gridded 5 x 5° GHCNS instrumental 
annual precipitation anomalies, the six individual model simulations 
(see Table 1) of annual precipitation and their median. b, Comparison 
of decadal anomalies between the Monsoon Asia Drought Atlas 
(MADA)”! and the corresponding domain in our gridded Northern 
Hemisphere hydroclimate reconstruction, the GHCN5 instrumental 


annual precipitation data set, and in the six individual model simulations 
of annual precipitation and their median. c, Comparison of decadal 
anomalies between the North American Drought Atlas (NADA)” and the 
corresponding domain in our gridded Northern Hemisphere hydroclimate 
reconstruction, the GHCNS5 instrumental annual precipitation data 

set, and in the six individual model simulations of annual precipitation 
and their median. The oval circles represent the mean, the small blank 
horizontal bar represents the median, the length of the bars represents the 
quartile range, and the dark grey dots represent the two standard deviation 
intervals, whereas the light grey dots represent outliers. 
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Extended Data Figure 4 | Correlations between gridded proxy and 
model hydroclimate anomalies, and gridded hydroclimate temperature 
proxy anomalies. a, Correlations between 45 centennial, lagged 

25 years, weighted gridded proxy hydroclimate anomalies and their 
corresponding median total annual precipitation anomalies from six 
CMIP5 models, listed in Table 1, over the past twelve centuries. b, The 
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Z-transformed block bootstrap p-values of the correlations shown in 
panel a. c, Correlations between 45 centennial, lagged 25 years, weighted 
gridded proxy hydroclimate anomalies and weighted gridded proxy 
temperature anomalies. d, The Z-transformed block bootstrap p-values 
of the correlations shown in panel c. Areas shown in grey in b and d have 
insignificant correlations. 
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Extended Data Figure 5 | Simulated median values of annual mean and standard deviation over the eleventh-nineteenth centuries. The 
precipitation from six atmosphere-ocean coupled general circulation colour scale in both panels is truncated at —2 and 2. b Gridded, weighted, 
models. a, Raw, centennial, model anomaly median values calculated values for the same data over land areas with at least three independent 
and treated and plotted in the same way as the hydrological proxy data. grid values within the estimated centennial correlation decay length for 
Only values from the same grid cells that are covered by proxy records centennial-scale hydrological variability. 


are extracted (Methods). Anomalies are shown relative to the centennial 
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Extended Data Figure 6 | Centennial temperature proxy anomalies to compute a gridded weighted mean value are left white. b, Gridded, 
updated from ref. 15. a, Gridded, weighted, centennial proxy anomalies weighted, centennial anomalies for simulated median values of annual 
values derived from the data listed in Supplementary Table 2 and shown mean temperature from six atmosphere-ocean coupled general circulation 
in Fig. 1b. Anomalies are shown relative to the centennial mean and models. Only values from the same grid cells that are covered by proxy 
standard deviation over the eleventh—-nineteenth centuries. The colour records are extracted (Methods). 
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Extended Data Figure 7 | Gradients of proxy-reconstructed and 
simulated Northern Hemisphere centennial hydroclimate anomalies 
along three meridional transects for the tenth, twentieth and 
seventeenth centuries. The tenth and twentieth centuries were the 
warmest centuries of the past twelve and the seventeenth century was 
the coldest. a, Smoothed, surfaced, and contoured weighted average 
centennial proxy anomalies for the tenth century (top right). The trend 
of the smoothed surfaced anomaly values, with their regression line, 


is shown along the meridional transects, passing through the densest data 
clusters (red line, North America; blue line, Europe and Africa; and black 
line, Asia). b, c, The same as a but depicting centennial proxy anomalies of 
hydroclimate for the seventeenth and the twentieth centuries, respectively. 
d-f, The equivalent analysis for the same centennial periods using 
median-model simulated values, extracted from the same proxy locations, 
of centennial precipitation anomalies (see Methods). 
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Extended Data Figure 8 | Distribution and density of hydroclimate from the data listed in Supplementary Table 1 and shown in Fig. la. 
proxy records. a, Number of contributing hydrological proxy records Anomalies are shown relative to the centennial mean and standard 
included in each proxy-centred anisotropic weighted mean calculation deviation over the eleventh-nineteenth centuries. The colour scale is 
where there are three or more neighbouring proxies found in the search truncated at —2 and 2 and areas with insufficient proxy coverage to 
radius. b, Raw, centennial, hydroclimate proxy anomaly values derived compute a gridded weighted mean value are left white. 
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Extended Data Figure 9 | Histograms of cross-correlations with between gridded temperature proxy data and temperature model (MT) 
kernel density estimate added. a, Cross-correlations between gridded data. e, The same as a but Fisher-transformed. f, The same as b but Fisher- 
hydroclimate proxy data and temperature proxy (PT) data. b, Cross- transformed. g, The same as c but Fisher-transformed. h, The same as d 
correlations between gridded hydroclimate model data and temperature but Fisher-transformed. The thin curves in each histogram represent the 
model data. c, Cross-correlations between gridded hydroclimate proxy kernel density estimates. 


(PH) data and hydroclimate model (MH) data. d, Cross-correlations 
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Extended Data Table 1 | Results from multiple sensitivity tests of our gridded weighted hydroclimate reconstruction 


a Median Pearson’s Median p- 
correlation value 
Reconstruction from only records with at least decadal resolution and 0.97 < 0.01 
centennial or better dating control 
Excluding ice cores 0.86 <0.01 
Excluding peat cores 0.90 < 0.01 
Excluding tree rings 0.77 < 0.01 
Excluding speleothems 0.84 < 0.01 
Excluding documentary data 0.90 < 0.01 
Excluding lake sediments 0.80 < 0.01 
Excluding marine sediments 0.88 < 0.01 
Retaining only calibrated precipitation records 0.89 < 0.01 
Excluding calibrated precipitation records 0.92 < 0.01 
Excluding records with annual signal 0.98 < 0.01 
Excluding records with summer signal 0.77 < 0.01 
Excluding records with winter signal 0.78 < 0.01 
b 
North American Drought Atlas” 0.34 0.02 
Monsoon Asia Drought Atlas”! 0.51 < 0.01 
c 
GHCNS instrumental annual precipitation data 0.66 0.05 


a, Statistics from correlating the full gridded weighted reconstruction at the grid-cell level with reconstructions derived from using different subsets of proxy data including tests removing one proxy 
type at the time. b, Statistics from correlating the full gridded weighted reconstruction at the grid-cell level with PDSI drought atlas reconstructions. ¢, Results from correlating the decadal-resolved 
reconstruction from the proxy subset including only records with at least decadal resolution and centennial or better dating control with the decadal mean gridded 5° x 5° GHCN5 instrumental annual 


precipitation data over the twentieth century. 
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Late Quaternary climate change shapes island 


biodiversity 


Patrick Weigelt!?, Manuel Jonas Steinbauer*, Juliano Sarmento Cabral!* & Holger Kreft! 


Island biogeographical models consider islands either as 
geologically static with biodiversity resulting from ecologically 
neutral immigration-extinction dynamics’, or as geologically 
dynamic with biodiversity resulting from immigration-speciation- 
extinction dynamics influenced by changes in island characteristics 
over millions of years”. Present climate and spatial arrangement 
of islands, however, are rather exceptional compared to most of 
the Late Quaternary, which is characterized by recurrent cooler 
and drier glacial periods. These climatic oscillations over short 
geological timescales strongly affected sea levels** and caused 
massive changes in island area, isolation and connectivity’, 
orders of magnitude faster than the geological processes of island 
formation, subsidence and erosion considered in island theory”®. 
Consequences of these oscillations for present biodiversity remain 
unassessed*”. Here we analyse the effects of present and Last Glacial 
Maximum (LGM) island area, isolation, elevation and climate on 
key components of angiosperm diversity on islands worldwide. We 
find that post-LGM changes in island characteristics, especially 
in area, have left a strong imprint on present diversity of endemic 
species. Specifically, the number and proportion of endemic species 
today is significantly higher on islands that were larger during the 
LGM. Native species richness, in turn, is mostly determined by 
present island characteristics. We conclude that an appreciation of 
Late Quaternary environmental change is essential to understand 
patterns of island endemism and its underlying evolutionary 
dynamics. 

The Quaternary is characterized by marked climatic oscillations 
between glacials and interglacials that had worldwide consequences 
for species distributions and diversity*®. Warm interglacials, like the 
Holocene (past 11,500 years), can be regarded as short-term anoma- 
lies embedded in much longer, cooler and drier glacial periods’. For 


a Hypothesis 1 b Hypothesis 2 


Variable importance 


most of the Late Quaternary, massive amounts of water accumulated in 
continental ice sheets. This lowered sea levels considerably‘ and influ- 
enced size, spatial arrangement and isolation of landmasses. In this con- 
text, the estimated 122 m sea level rise from the LGM (~21,000 years 
before present) to the early Holocene was particularly drastic and rapid? 
(Fig. la). 

Late Quaternary sea level lowstands had far-reaching consequences 
for island geography (Fig. 2). Many islands were much larger and less 
isolated®!°. Neighbouring islands today separated by shallow waters 
formed single landmasses, for example, Fuerteventura and Lanzarote 
made up the larger Mahan’ (Fig. 2d), and currently submerged sea- 
mounts emerged and created stepping-stones for species dispersal!°-?. 
Moreover, many islands were repeatedly connected and disconnected to 
the mainland’*"*. Such profound changes in island geography should 
have strongly affected immigration, speciation and extinction, the core 
processes determining the assembly and diversity of island biota>’. 
Present island diversity thus probably carries an imprint of past physical 
and bioclimatic conditions. 

While the effects of Quaternary climate change on biodiversity 
are increasingly well-recognized for mainlands®*">, they are under- 
represented in island biogeography theory®”. The equilibrium theory of 
island biogeography’, perhaps the most influential framework in island 
biology, assumes species richness to result from a dynamic equilibrium 
between immigration (plus speciation) and extinction. Immigration 
and speciation rates are higher and extinction rates lower on larger 
islands, while immigration rates are lower and extinction and specia- 
tion rates higher on more isolated islands’'**. The equilibrium theory 
assumes island geography not to change, and limitations of this static 
view have widely been recognized!*'°. More recent models emphasize 
the dynamic nature of islands and relate biogeographical rates and bio- 
diversity to island ontogeny-related changes in area and topographical 


c¢ Hypothesis 3 


: oo 
& 
e 

ae e 

e n "= m~ 
e oO = 
e i: ra) = a 
e Endemics oO = — 

Gb Endemics ™ ae 


No. of natives RP ALGM 

© 
= No. of endemics | @ ALGM e Natives 
= 3 
oO = 
o 
6 OF 8 
9 -20,000 years 9 
79) [oy 

op) e 

. e 
T 
-120 ~Now ivy 


T 1 T 
0 50 100 150 


Time (kyr before present) 
Figure 1 | Hypothesized effects of post-LGM sea-level changes on island 
biodiversity. a, We hypothesize that present (P) plant diversity has been 
affected by LGM island characteristics (sea-level curve in a after ref. 3). 
Changes in island characteristics since the LGM (ALGM) should therefore 
be important predictors of present diversity patterns. These effects should 
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be stronger for endemic than for native non-endemic species richness. 
kyr, thousand years. b, c, Hypothesized effects of changes in island area 
(Aarea) (b) and stepping-stone isolation (Aisolation; considering also 
currently submerged seamounts and changes in mainland coastlines) (c) 
from LGM to present (P) on island plant diversity. 
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Figure 2 | Examples of spatial arrangements of present landmasses 
(blue) and modelled landmasses during the LGM (orange). LGM 
coastlines were estimated based on global bathymetry data*’ assuming a 
sea-level decrease of 122 m at 20,000 years before present* (see Methods). 
Present coastlines were taken from the GADM database of Global 
Administrative Areas (http://www.gadm.org/version1). a, Puerto Rico and 
British Virgin Islands. b, Hawaiian Islands. c, Seychelles. d, Canary Islands. 
Note the different changes in area, isolation and connectivity of islands. 
Exemplary islands mentioned in the text are named. 


17° Ww 16° 15°W 14° Ww 
1 1 1 1 


heterogeneity over long geological time scales”*. Also shallow-time 
changes (that is, changes over short geological time scales) in island 
area, isolation, elevation and climate caused by Late Quaternary oscil- 
lations, have recently been proposed to have influenced immigration, 
speciation and extinction, but whether this has left an imprint in global 
island biodiversity remains untested. 

Here, we evaluate present and shallow-time past drivers of species 
richness and endemism of angiosperms (flowering plants) on 184 
islands worldwide. We hypothesize post-LGM changes in physical 
(area, isolation, elevation) and bioclimatic (temperature, precipitation) 
island characteristics to be strong predictors of present angiosperm 
diversity (hypothesis 1; Fig. 1a). Especially endemic species richness, 
which relates to evolutionary dynamics, should reflect past physical 
and bioclimatic conditions’. Considering that islands were larger dur- 
ing the LGM, islands with greater area change (Aarea) should har- 
bour both more native and endemic species than predicted by their 
present size (hypothesis 2; Fig. 1b). Islands with greater changes in 
isolation (Aisolation) should harbour more native, but fewer endemic 
species (hypothesis 3; Fig. 1c), owing to higher LGM connectivity of 


100 | NATURE | VOL 532 | 7 APRIL 2016 


landmasses increasing inter-island dispersal as well as colonization 
from and to the mainland. 

We estimated LGM coastlines of islands and continents assuming a 
eustatic sea level drop of 122m (ref. 4). The 184 studied islands experi- 
enced strong post-LGM changes in most physical and bioclimatic con- 
ditions owing to warming climate and resulting sea level rise (Fig. 2 and 
Extended Data Fig. 1). Island area decreased by 5,035 km? and 71.1% 
on average (ranging from —4 to —61,377km7; —6.3% to —99.997%). 
Islands that are currently small, but merged with other islands during 
the LGM, experienced particularly strong proportional reductions in 
area (for example, —99.8% for Anegada, British Virgin Islands; —99.6% 
for Mahé, Seychelles; Fig. 2a, c). These post-LGM changes have been 
much faster than average rates of geologic processes of island forma- 
tion, subsidence, and erosion (Extended Data Fig. 2). Island isolation 
measured as distance to the nearest landmass >100,000 km? account- 
ing for present stepping-stone islands” and currently submerged 
seamounts increased on average by 107km (+3 to +513 km; Extended 
Data Fig. 1). Annual temperatures increased on average by 2°C and 
annual precipitation by 163 mm changing in both directions (—2,836 
to +2,684mm). Changes in elevation were high if a focal island was 
connected to a high-elevation island (+327 m on average). 

Regression analyses showed that present island biodiversity is 
strongly related to post-LGM changes in island characteristics (Fig. 3), 
supporting hypothesis 1. Endemism measured as the number and 
proportion of endemic species, in particular, was strongly affected by 
LGM.-related variables (sum of scaled Akaike’s information criterion- 
weighted standardized regression coefficients >0.5; Fig. 3a) and could 
be well explained by a combination of post-LGM changes and present 
island characteristics (adjusted deviance explained (D724) all > 0.8; 
Extended Data Table 1). Effects of area change via sea-level change on 
endemism were strong, whereas direct effects of changes in tempera- 
ture and precipitation were weak. Species richness, in contrast, was 
mostly related to present environmental characteristics, and post-LGM 
changes showed only minor effects (Fig. 3a). The strong effects of pres- 
ent environments on species richness might be explained by diversity 
dynamics closely following climate-induced changes”, suggesting that 
overall species richness on islands is close to equilibrium with present 
conditions. 

Island area was the variable that changed most since the LGM, and 
the correlation between LGM and present area was weak (r= 0.32, 
P<0.001; Extended Data Fig. 3). Area is a key variable in island bio- 
geography: first, larger islands sustain larger populations resulting in 
lower extinction rates'; second, larger islands receive more colonizers 
via target-area effects'’; third, large islands have higher within-island 
speciation rates'®?!”? owing to more heterogeneous environments 
(correlation between area and elevation: r= 0.74, P< 0.001), which is 
supported here by strong effects of present elevation on number and 
proportion of endemic species’ (Fig. 3a and Extended Data Table 1). 
Moreover, large, high-elevation islands may be refugia for once wide- 
spread lowland species and montane species that became extinct on 
lower islands*’. Accordingly, species richness and particularly rich- 
ness and proportion of endemics were higher on islands that were 
larger during the LGM (Fig. 3b and Extended Data Fig. 4), supporting 
hypothesis 2. However, the positive effect of Aarea on species richness 
was small (Fig. 3b), indicating that the number of species on a shrunken 
island is only slightly larger than expected from its present area. 
Considering this and that island floras might have been unsaturated 
during the LGM”, a potential post-LGM extinction debt” seems to 
have a minor role. 

The effect of Aarea on native species richness emerged mainly from 
a strong effect of Aarea on endemic richness (single-island endemics 
(SIE) in Fig. 3 and Extended Data Table 1). With endemic species 
excluded, in fact, the slope of the Aarea-richness relationship was 
even shallower (standardized regression coefficient and variable impor- 
tance of Aarea: endemics > natives > native non-endemics; Fig. 3a, b). 
Island endemics usually result from in situ differentiation rather than 
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Figure 3 | Predictors of angiosperm diversity on 184 islands worldwide. 
a, Relative importance of present island characteristics (blue) and post- 
LGM changes (ALGM; orange) as predictors of angiosperm diversity, 

that is, number of native species (S), native non-endemics (N), single- 
island endemics (SIE), species endemic to past island units (PIE), and 
proportions of single-island endemics (pSIE) and species endemic to 

past islands (pPIE). ALGM variables quantify the difference in island 
characteristics between LGM and today. No. of entities, number of present 


from extinction elsewhere'’. Islands repeatedly larger in the past may 
hence have been arenas of within-island speciation causing a strong 
imprint in present endemism patterns. Consequently, endemic spe- 
cies may inhabit islands presently too small for frequent within-island 
isolation and associated in situ differentiation!®. A further explana- 
tion for the higher number and proportion of endemics on formerly 
larger islands would be a lower extinction risk of endemics relative to 
native non-endemic species. Species evolved on islands might be pre- 
adapted to smaller population sizes and dynamic island environments, 
and might be less affected by decreasing area, which is in contrast to 
previous interpretations”®. 

Contrary to hypothesis 3, Aisolation had no effect on native and 
endemic species richness and the proportion of endemics (Fig. 3a, c 
and Extended Data Fig. 4). The difference between present and LGM 
island isolation was small and both were strongly related (r =0.997, 
P<0.001; Extended Data Fig. 3). Furthermore, most islands in our 
study are quite isolated from the mainland (mean = 1,427 km) and 
consequently immigration events are generally rare. For Hawaiian 
plants, for instance, the net colonization rate (disregarding extinction) 
is estimated at one colonizer every 98,000 years”’. Sea-level changes 
may therefore have only weakly affected rates of long-distance disper- 
sal from major landmasses, even though isolation undoubtedly affects 
immigration’ and speciation’. 

Probably more importantly, inter-island dispersal was affected by 
post-LGM changes in within-archipelago connectivity'!**. Accordingly, 
we found a strong negative effect of the number of present islands that 
merged during the LGM on the number of single-island endemics (SIE 
in Fig. 3a and Extended Data Fig. 5). This means that islands that only 
lost area since the LGM have on average more single-island endemics 
than similar sized-islands that were connected to other islands, indi- 
cating a strong role of past connectivity for intra-archipelagic dispersal. 


island entities that were part of single LGM islands; +-/—, direction 

of important effects. b, c, Effects of post-LGM changes in island area 
(Aarea) (b) and stepping-stone isolation (Aisolation) (c) controlling 
for all covariables in generalized linear models. Relationships and 95% 
confidence intervals (dashed) are shown for S (black), N (blue), SIE 
(light green) and PIE (green). **P < 0.01, ***P < 0.001, NS, not 
significant (P > 0.05). 


Inter-island dispersal of single-island endemics immediately changes 
their endemism status leading to fewer single-island endemic species 
on formerly connected islands. Accordingly, when considering species 
endemic to the past island entities, the effect of the number of present 
islands decreased (compare past-island endemics (PIE) in Fig. 3a and 
Extended Data Fig. 5). However, the still significant negative effect con- 
tradicts the notion of repeated fusion and fission of islands acting as 
speciation pump”’, which might at the given spatial and temporal scale 
only be important for certain taxonomic groups of low dispersal ability 
(compare ref. 10). Dispersal and gene flow between formerly connected 
islands appears to impede rather than promote speciation in plants. The 
strong positive effect of Aarea on endemic richness hence suggests that 
speciation is promoted in cases of high environmental opportunity and 
intra-archipelagic connectivity without actual land-bridge connections. 
Considering these area changes and fusion-fission dynamics may also 
help to understand additional biogeographical patterns and processes, 
for example, why certain lineages or functional groups diversify more 
frequently on islands than others”. 

In conclusion, sea-level changes during the Late Quaternary have 
left a strong imprint on the present distribution of island endemics. 
Especially, the strong effects of changes in area and intra-archipelagic 
connectivity on endemism suggest a heretofore underestimated 
legacy of LGM conditions in immigration, speciation and extinc- 
tion dynamics on islands. Island biogeography theory hence needs 
to incorporate the effects of shallow-time geographic dynamics on 
inter-island dispersal and diversification”. Future island research 
should also focus on other, poorly understood LGM-associated 
shifts in, for example, ocean currents, wind systems, or source areas. 
Considering the disproportional contribution of island endemics to 
global plant diversity*°, these dynamics have profound implications 
for global biodiversity. 
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METHODS 


Island data. We estimated last glacial maximum (LGM) coastlines of islands and 
mainlands based on ETOPO! global bathymetry data*! at 1-arcminute resolution 
assuming a eustatic sea level decrease of 122 m at 20,000 years before present* 
(Fig. 2). High-resolution present coastlines were taken from the GADM database 
of Global Administrative Areas, version 1 (http://www.gadm.org/version1). Past 
and present island area and isolation were derived from the respective coastline 
data (Fig. 2). To account for present stepping-stone islands as well as seamounts 
that emerged as potential stepping-stones during the LGM, we used the shortest 
distance over water from an island to the nearest landmass larger than 100,000 km? 
as isolation metric (metric “D9, in ref. 12). Calculating isolation as the proportion 
of water in the surrounding of each island as suggested by ref. 12 led to lower val- 
ues of present isolation compared to the LGM when large neighbouring islands 
were merged in the past (Extended Data Fig. 3). We therefore opted for using 
stepping-stone isolation as isolation metric. In addition, the number of present 
island entities covered by species checklists that formed part of LGM island units 
entered the analyses as a measure of past inter-island connectivity (no. of entities). 
Island elevation of both present and past islands was derived from a global digital 
elevation model”. For past islands, elevation was the maximum elevation of all 
present islands included in the past island unit plus 122 m. 

During the past one million years, sea levels repeatedly fell below —100m 
and today’s sea level represents one of few highstands*. The island characteristics 
calculated for the LGM are hence proxies for recurrent conditions that might 
have shaped today’s island biotas over at least one million years. In addition to 
global eustatic sea-level changes, oceanic island subsidence and loading of the 
water column may affect regional sea levels and therefore also the area, isolation 
and elevation of oceanic islands at the relevant time scale of millennia’®"'. At the 
global scale, however, these effects are currently difficult to address, and, compared 
to the magnitude of eustatic sea-level changes, their effects are small!" Eustatic 
fluctuations are hence the most relevant driver of recurrent sea-level changes at 
the global extent and at the relevant time scale (compare Extended Data Fig. 2). 

Maximum values of present annual mean temperature and annual precipitation 
per island were derived from WorldClim'”. For past climatic variables, hindcasts 
for 21,000 years before present were derived from the CCSM3 and MIROC3.2 
models**, downscaled to 30 arcseconds. We averaged the two model predictions as 
LGM mean annual temperature and precipitation according to refs 9, 14. Maximum 
values of past annual mean temperature and annual precipitation were extracted 
based on all present islands belonging to a past island unit. Note that climate hind- 
casts for islands are considered less reliable compared to mainland hindcasts**. We 
therefore focus our interpretations on the physical island characteristics area, iso- 
lation and number of island entities merged in the past and only consider climatic 
changes as covariables. In addition to natural changes in island characteristics, 
human-induced changes since the LGM influenced island floras**. However, here 
we only considered native species and whenever possible included species that 
recently went extinct owing to human activity to focus on natural drivers of island 
plant diversity. 

For 184 oceanic islands worldwide that were not connected to the mainland dur- 
ing the LGM", we derived angiosperm species richness of native species (S), native 
non-endemic species (N), species endemic to single present island entities (SIE), 
and species endemic to all islands that were comprised by single island entities 
during the LGM (PIE), based on a global database of island floras*®?” (Extended 
Data Fig. 6). No statistical methods were used to predetermine sample size. The 
subset of the database used here comprised species lists from published floras, 
checklists and online databases***! as well as data from original research papers 
provided by the authors!©?>"-"?, Infra-specific endemics were not considered. As 
native non-endemic species, we considered species neither endemic to present nor 
to past island entities (for correlations among diversity metrics see Extended Data 
Fig. 7). Islands were included only if all islands of the past island unit were covered 
by species lists and islands were excluded if their checklists referred to more than 
one past island unit (for example, Puerto Rico). In total, the data set included 
17,626 species including 6,257 island endemics. All species names were subjected 
to a comprehensive taxonomic standardization based on The Plant List (version 
1.1, http://www.theplantlist.org) and iPlant’s taxonomic name resolution service 
(version 3.2, http://tnrs.iplantcollaborative.org), following ref. 36. 

Statistical analysis. We used present island characteristics and the differences 
between past and present conditions (LGM values — present values, depicted by A) as 
predictor variables for plant diversity and endemism. Because isolation and tempera- 
ture increased since the LGM, we multiplied Aisolation and Atemperature by —1 
to be able to relate increases in environmental change to the diversity metrics. Area, 
precipitation, elevation, Aarea, Aisolation and Aelevation were logo-transformed 
after adding 1 in the case of elevation and Aarea to normalize their distributions 
and to moderate differences between very large values compared to differences 
between smaller values. Present island characteristics were only weakly correlated 
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to the respective differences between past and present (all |r| < 0.44; Extended 
Data Table 2). 

We modelled species numbers (S, N, SIE, PIE) using generalized linear models 
with a Poisson distribution. For proportions of endemic species (pSIE, pPIE), 
we used models with a binomial distribution where endemic species and native 
non-endemic species entered as composite response variables to account for 
differences in species richness. Since spatial autocorrelation in model residuals 
might violate the assumptions of generalized linear models”, we included a spatial 
autocovariate built from residuals of the non-spatial model®. The autocovariate 
was computed using the function autocov_dist from the R package spdep” using 
an optimized spatial neighbourhood structure with a neighbourhood distance of 
1,000km!*°”, We calculated models with all possible predictor variable combina- 
tions and selected the best models from all candidate models based on Akaike's 
information criterion (AIC) (Extended Data Table 1)”*. For graphical representa- 
tion of the relationships between post-LGM changes and diversity metrics, we used 
full-model predictions including all predictor variables (Fig. 3b, c and Extended 
Data Figs 4 and 5). Regression lines were estimated with covariables held constant 
at the mean of their empirical values across the islands. As a measure of varia- 
ble importance, we used AIC-weighted standardized regression coefficients. The 
standardized regression coefficients of all candidate models were weighted by the 
models’ AIC weights and summed up for all possible models including a given 
variable’®**, The variable importance values of all predictor variables were then 
scaled to sum up to one for comparison across models of richness of native and 
endemic species and proportions of endemic species (Fig. 3a). 

Analyses were performed using ArcGIS 9.3 (ESRI, Redlands, USA) and the R 
statistical programming language (version 3.1.2, R Core Team). All diversity met- 
rics, present bioclimatic and physical characteristics as well as post-LGM changes 
as used in the statistical analyses can be found in the Supplementary Data. 
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Extended Data Figure 2 | Geologic age and area of the four main 
geologic complexes of the Hawaiian Islands exemplifying the rate of 
area decrease with time due to subsidence and erosion for volcanic 
islands. In this example scenario, we assume that the four major 
complexes of the Hawaiian Islands (Fig. 2b) all reached approximately 
the same maximum area size and that the youngest complex, Big Island, 
already reached its full extent. The area decrease from Big Island to 

Maui Nui equals —0.0072 km? per year and from Maui Nui to O’ahu 
—0.0016 km? per year (see map in Fig. 2b). The linear fit over all points 
has a slope of —0.0016 km? per year. The negative exponential curve has 
a slope of —0.0034 km’ per year at the beginning and of —0.0006 km? 

per year at the end. For comparison, the 184 islands used for this paper 
experienced an area decrease due to rising sea levels of 0.5035 km? per 
year over 10,000 years on average and the post-LGM decrease in island 
area for islands like Anegada or Mahé, was up to 1,000 times faster than 
the area decrease indicated here for the Hawaiian Islands. The notion that 
post-LGM changes have been much faster than average rates of geologic 
processes of island formation and erosion, therefore, most likely holds true 
even if the assumptions above are not perfectly met. 
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Extended Data Figure 4 | Effects of post-LGM changes in biophysical 
island characteristics on the proportion of endemic species on 

184 islands worldwide. a, b, Regression lines were predicted for changes in 
area (Aarea) (a) and stepping-stone isolation (Aisolation) (b) after accounting 
for all past and present covariables which were held constant at the mean of 
their empirical values across the islands (see Fig. 3 and Extended Data Fig. 5). 
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Relationships are shown for single-island endemics (light green) and 
species endemic to past island units (differs from single-island endemics 
in cases where several present islands originated from one island during 
the last glacial maximum) (green). Dashed lines indicate 95% confidence 
intervals. 
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Extended Data Figure 5 | Effects of post-LGM changes in biophysical 
island characteristics on angiosperm diversity on 184 islands 
worldwide. Post-LGM changes (with A) give the difference in island 
characteristics from the LGM to today. No. of entities, number of 
present-island entities that made up single past island units during 

the LGM. Regression lines were predicted after accounting for all past 
and present covariables which were held constant at the mean of their 
empirical values across the islands (for Aarea and Aisolation see Fig. 3 
and Extended Data Fig. 4). In a, c, e and g, relationships are shown for 
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species numbers of natives (black), native-non-endemics (blue), 
single-island endemics (light green) and species endemic to past island 
units (green; differs from single-island endemics in cases where several 
present islands originated from one island during the LGM). In b, d, f 

and h, relationships are shown for proportions of single-island endemics 
(light green) and species endemic to past island units (green). Dashed lines 
indicate 95% confidence intervals. *P < 0.05, **P < 0.01, ***P< 0.001, 
n.s., not significant (P > 0.05). 
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Extended Data Figure 6 | Species richness of native and endemic proportion of species endemic to past island units one a, aa 
angiosperms on the 184 islands worldwide used for this study. richness is given in numbers of species. Numbers in legends indicate 

a-f, Native species (a), single-island endemics (SIE) (b), past-island-unit category borders. World maps are based on the GADM database of 
endemics (PIE; differs from SIE in cases where several present islands Global Administrative Areas, version 1 (http://www.gadm.org/version1). 
originated from one island during the last glacial maximum) (c), native See Supplementary Data for values and see Methods for references used to 
non-endemics (d), proportion of single-island endemics (pSIE) (e), and compile the data set. 
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Extended Data Figure 7 | Relationships and Pearson correlation 
coefficients (r) of native and endemic species richness of angiosperms 
on 184 islands worldwide. a-c, Correlations of native species richness 
(S) with native non-endemics (N) (a), single-island endemics (SIE) (b) 
and past-island-unit endemics (PIE; differs from SIE in cases where 
several present islands originated from one island during the last glacial 
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maximum) (c). d-f, Correlations of native non-endemic species richness 
(N) with single-island endemics (SIE) (d) and past-island-unit endemics 
(PIE) (e) as well as richness of single-island endemics (SIE) with past- 
island-unit endemics (PIE) (f). Diagonals indicate lines of equal values. 
FEED < (0,001. 
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Extended Data Table 1 | Model statistics for best candidate models from AlC-based model selection for species richness of native (S), native 
non-endemic (N), single-island endemic (SIE), and past-island-unit endemic (PIE) angiosperm species and proportions of endemic species 
(pSIE, pPIE) in dependence on past and present island characteristics 


Ss Estimate Std.error z-value P__ pseudo-R? 
Intercept 5.058 0.006 782.678 *** 

logio Area 0.413 0.010 41.389 *** Dadi 
Isolation -0.521 0.007 -77.766 =*** 0.934 
Temperature 0.344 0.007 51.157. = 

logio Precipitation 0.320 0.006 52.234 *** McFadden 
logio Elevation 0.178 0.010 18.713 *** 0.919 
logio AArea 0.262 0.011 24.164 *** 

No. of entities -0.167 0.011 -15.422 *** Efron 
Spatial autocovariate 0.064 0.001 81.514 *** 0.969 
N 

Intercept 4.993 0.007 735.901 *** 

logio Area 0.421 0.007 57.625 ** D?agj 
Isolation -0.576 0.007 -80.661 *** 0.916 
Temperature 0.306 0.007 42.926 *** 

logio Precipitation 0.331 0.007 47.765 *** McFadden 
logio Elevation 0.147 0.010 15.289 *** 0.896 
logio AArea 0.085 0.007 12.520 *** 

logio Alsolation -0.025 0.006 4.153 *** Efron 
Spatial autocovariate 0.068 0.001 81.542 *** 0.950 
SIE 

Intercept 1.372 0.051 27.114 *** 

logio Area 0.039 0.058 0.669 D?aqj 
Isolation -0.238 0.026 9.158 *** 0.921 
Temperature 0.608 0.028 21.802 *** 

logio Precipitation 0.088 0.022 4.000 *** McFadden 
logio Elevation 1.045 0.061 TE216: 0.913 
logio AArea 1.266 0.063 19.947 *** 

logio Alsolation -0.059 0.019 -3.072 ** Efron 
ATemperature -0.164 0.028 5.851  *** 0.986 
APrecipitation -0.088 0.019 4.603 *** 

logio AElevation -0.419 0.059 -7.058 *** 

No. of entities -1.821 0.102 -17.894 *** 

Spatial autocovariate 0.229 0.007 33.276 *** 

pSIE 

Intercept -3.636 0.051 -71.385  *** 

logio Area -0.499 0.065 7.721 *** D?agj 
Temperature 0.241 0.033 7.286 *** 0.830 
logio Precipitation -0.265 0.025 -10.554 *** 

logio Elevation 1.205 0.068 17.670 *** McFadden 
logio AArea 0.736 0.061 11.974 *** 0.809 
ATemperature -0.252 0.025 -10.023 *** 
APrecipitation -0.223 0.019 -11.631 *** Efron 
logio AElevation -0.254 0.060 4.254 *** 0.596 
No. of entities -1.766 0.103 -17.165 *** 

Spatial autocovariate 0.233 0.006 39.781 *** 

PIE 

Intercept 2.190 0.029 74.3770 *** 

logio Area 0.260 0.045 5.834 *** D?agj 
Isolation -0.068 0.025 -2.746 ** 0.918 
Temperature 0.576 0.024 24.225 *** 

logio Precipitation 0.062 0.021 2.979 ** McFadden 
logio Elevation 0.762 0.047 16.244 *** 0.906 
logio AArea 1.080 0.048 22.715 =*** 

logio Alsolation -0.034 0.017 -1.969 * Efron 
ATemperature 0.012 0.024 0.497 0.986 
APrecipitation -0.201 0.015 -13.144 *** 

No. of entities -0.542 0.038 14.313 *** 

Spatial autocovariate 0.230 0.005 46.127 *** 

pPIE 

Intercept -2.872 0.031 -91.579 *** 

logio Area -0.177 0.054 -3.265 ** ID? 
Temperature 0.319 0.027 11.829 *** 0.805 
logio Precipitation -0.171 0.023 7.341 *** 

logio Elevation 0.682 0.050 13.521 *** McFadden 
logio AArea 0.732 0.059 12.353", 0.777 
logio Alsolation 0.193 0.021 9.071 *** 
ATemperature -0.165 0.024 6.885 *** Efron 
APrecipitation -0.188 0.018 -10.562  *** 0.520 
logio AElevation -0.031 0.036 -0.860 

No. of entities -0.512 0.045 -11.388  *** 

Spatial autocovariate 0.305 0.007 44.398 *™* 


Variables that quantify the change in island characteristics between the LGM and present are depicted by A. No. of entities represents the number of present island entities that made up single past 
island units during the LGM due to lower sea levels. Estimates are based on zero-mean-unit-variance standardized predictor variables for comparison (except for the spatial autocovariate). All best 
models were characterized by AIC values more than 7 units smaller than those for the second-best models. As measures of variability explained we provide pseudo-R? values: Daa), deviance explained 
adjusted for sample size and number of predictors??, McFadden, McFadden’s pseudo R? (ref. 100), Efron, R?-value of a linear model between the observed and predicted values of the generalized 
linear models (GLM)!°!, Please note that Efron’s pseudo-R? values for Poisson GLMs (S, N, SIE, PIE) probably overestimate the variability explained as the linear models between predicted and 
observed values were mainly driven by few large values. *P<0.05, **P<0.01, ***P<0.001. 
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Extended Data Table 2 | Matrix of Pearson correlation coefficients of all 11 past and present environmental predictors used in this study 
(n= 184 islands) 


ri Tempera- logio Pre- logio logio ATempera- APre- logio 
legumes. | teolaton ture cipitation _ Elevation logioAArea A jcolation ture cipitation _ AElevation 
Isolation -0.060 
Temperature -0.121 0.127 # 
logie =. 0.178 * 0.420** 0.128 # 
Precipitation 
logio Elevation 0,735'*** 0.062 -0.207 ** 0.218 ** 
logio AArea 0.188 * -0.334 *** 0.056 0.055 -0.074 
logio ee ek 
Alsolation 0.008 -0.036 0.013 0.482 -0.085 0.461 
ATemperature -0.004 -0.364 *** -0.439 *** -0.131 # 0.141 # -0.049 -0.003 
APrecipitation -0.353 *** 0.220 ** 0.149 * -0.043 -0.255 *** 0.060 0.004 -0.205 ** 
logio ee * fo ee m he 
AElevation -0.371 -0.179 0.145 # -0.133 # -0.378 0.602 0.200 -0.127 # 0.387 
No. of entities -0.350 *** -0.208 ** 0.206 ** 0.032 -0.374 *** 0:715 *** 0.399 *** -0.043 0.297 *** 0.704 *** 


Past variables are depicted by A and give the difference between island characteristics during the last glacial maximum and today. No. of entities represents the number of present island entities that 
made up single past island units during the LGM due to lower sea levels. #P < 0.1, *P<0.05, **P < 0.01, ***P<0.001. 
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A specific area of olfactory cortex involved in 
stress hormone responses to predator odours 


Kunio Kondoh!*, Zhonghua Lu'*, Xiaolan Ye!, David P. Olson*+, Bradford B. Lowell? & Linda B. Buck! 


Instinctive reactions to danger are critical to the perpetuation of 
species and are observed throughout the animal kingdom. The 
scent of predators induces an instinctive fear response in mice 
that includes behavioural changes, as well as a surge in blood 
stress hormones that mobilizes multiple body systems to escape 
impending danger’”. How the olfactory system routes predator 
signals detected in the nose to achieve these effects is unknown. 
Here we identify a specific area of the olfactory cortex in mice that 
induces stress hormone responses to volatile predator odours. 
Using monosynaptic and polysynaptic viral tracers, we found that 
multiple olfactory cortical areas transmit signals to hypothalamic 
corticotropin-releasing hormone (CRH) neurons, which control 
stress hormone levels. However, only one minor cortical area, 
the amygdalo-piriform transition area (AmPir), contained 
neurons upstream of CRH neurons that were activated by volatile 
predator odours. Chemogenetic stimulation of AmPir activated 
CRH neurons and induced an increase in blood stress hormones, 
mimicking an instinctive fear response. Moreover, chemogenetic 
silencing of AmPir markedly reduced the stress hormone response 
to predator odours without affecting a fear behaviour. These 
findings suggest that AmPir, a small area comprising <5% of 
the olfactory cortex, plays a key part in the hormonal component 
of the instinctive fear response to volatile predator scents. 

Animals sense environmental dangers that challenge their survival 
and respond with behavioural and physiological changes to eliminate 
the challenge. The olfactory system detects chemical cues emitted 
from predators that signal danger and stimulate an instinctive fear 
response. In addition to characteristic behaviours, this response 
includes increases in blood stress hormones that are reminiscent of 
responses to fear and stress in humans and are controlled by CRH 
neurons in the paraventricular nucleus of the hypothalamus (PVN) Pe. 
The neural circuits that convey predator odour signals from the nose 
to CRH neurons are presently unknown. 

Volatile predator cues are detected in the olfactory epithelium of the 
nose, which also detects common odorants, whereas fear-stimulating 
proteins are detected in the vomeronasal organ (VNO), an accessory 
olfactory structure*°. Sensory signals travel from the nose through 
the olfactory bulb to the olfactory cortex (OC) and from the VNO 
through the accessory olfactory bulb to the vomeronasal amygdala 
(VA)®’. The mouse nose has ~1,000 different odorant receptors, each 
expressed by a unique subset of dispersed sensory neurons*!!. The 
olfactory bulb contains a semi-stereotyped map of receptor inputs 
in which individual glomeruli and projection neurons appear to be 
dedicated to one receptor'?"4, but single bulb neurons can project 
widely in the OC*!”, The OC comprises multiple distinct areas whose 
respective functions are largely obscure®’. Recent studies have linked 
an amygdala OC area to odour attraction and aversion’*. We investi- 
gated whether a specific OC area governs stress hormone responses 
to predator odours. 


We first asked which OC areas can transmit signals to CRH neurons. 
We developed two Cre-recombinase-dependent Bartha pseudora- 
bies viruses (PRVs) that have nonfunctional thymidine kinase (TK) 
genes, but require TK to replicate and travel retrogradely across 
synapses!®”°, and a lentivirus (LVF2TK) with Cre-dependent expres- 
sion of haemagglutinin-tagged TK (HA-TK) (Fig. la). PRVB177 is 
a ‘polysynaptic PRV’ with irreversible Cre-dependent expression of 
HA-TK that can cross multiple sequential synapses. PRVB316 is a 
‘monosynaptic PRV’ with Cre-dependent expression of GFP that can 
replicate in neurons co-infected with LVF2TK and spread across one 
synapse, but not further. We injected the viruses into the PVN of mice 
that express Cre in CRH neurons (CRH-IRES-Cre (CRH-Cre) mice)”! 
(Fig. 1b) and then immunostained brain sections for HA (PRVB177) 
or GFP (PRVB316) to visualize virus-infected neurons. 

Preliminary experiments confirmed that Cre is expressed in PVN 
CRH neurons in CRH-Cre mice and that the viruses function as 
expected. In CRH-Cre mice crossed with Rosa-floxstop-GFP mice, 
93.9 +0.5% of GFP* cells expressed Crh (n=4) (Extended Data 
Fig. la). After PVN co-injection of PRVB316 and LVF2TK, some 
neurons expressed both viruses (Extended Data Fig. 1b). Neurons 
infected with PRVB316, but not LVF2TK, were subsequently detected 
outside the PVN, whereas none were detected outside the PVN when 
PRVB316 was injected without LVF2TK (Extended Data Fig. 1c). No 
infected neurons were seen in wild-type animals injected with the 
monosynaptic or polysynaptic PRV (Extended Data Fig. Ic, d). 

Virus-infected neurons were detected in multiple brain areas after 
PRV infection of CRH neurons (Extended Data Figs 2 and 3). The 
monosynaptic PRV (PRVB316) infected 31 brain areas, including areas 
known to project axons to the PVN” and areas linked to CRH neuron 
activation by certain non-olfactory stressors’. 

In the olfactory system (Fig. 1c), PRVB316 infected only the 
posterior medial amygdala (MEAp), part of the VA (Fig. 1d, e), 
whereas, by day 4 after injection, PRVB177 infected five OC and both 
VA areas (Fig. 1d, f, g). Most brain areas infected with PRVB177 on 
day 3 were also infected by PRVB316 (Extended Data Figs 2 and 3), 
suggesting that OC areas infected on day 4 may transmit signals to 
CRH neurons via one intermediate relay. 

We observed a regional bias in the locations of neurons upstream 
of CRH neurons in some olfactory areas, but not others. PRVB177* 
neurons were randomly distributed along the anterior—posterior axis 
in most olfactory areas, but were concentrated in posterior regions of 
the posterior piriform cortex (pPir) and AmPir and in the posteroven- 
tral quadrant of the MEA (Extended Data Fig. 4). 

Dual labelling for PRVB177 and markers of excitatory glutamater- 
gic neurons (Vglut1 (also known as Slc17a7) and Vglut2 (also known 
as SIlc17a6)) or inhibitory GABAergic neurons (Gad1 and Gad2) 
indicated that some neurons upstream of CRH neurons in the MEA 
were GABAergic, but the majority of those labelled for the markers in 
higher olfactory areas were glutamatergic (Extended Data Fig. 5). The 
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Figure 1 | Higher olfactory areas contain neurons upstream of CRH 
neurons. a, Cre recombinase causes irreversible expression of HA-TK 
from PRVB177 and LVF2TK, and GFP from PRVB316. gG, gG locus; Pemv, 
cytomegalovirus promoter; pA, polyadenylation signal; WPRE, woodchuck 
hepatitis post-transcriptional regulatory element. b, Viruses were injected 
into the PVN (blue) of CRH-Cre mice. c, The OC (top) comprises multiple 


glutamatergic neurons could potentially activate either excitatory or 
inhibitory neurons directly presynaptic to CRH neurons in other brain 
areas, and thus either stimulate or inhibit CRH neurons. 

Neurons upstream of CRH neurons in different OC areas could con- 
ceivably transmit redundant predator odour signals to CRH neurons. 
To investigate this possibility, we used a neural activity marker, nuclear 
Arc messenger RNA (nArc)*? to identify PRVB177-infected neurons 
activated by the fox predator odorant 2,5-dihydro-2,4,5-trimethylthi- 
azoline (TMT)*”4 or bobcat (predator) urine. Preliminary experiments 
confirmed that TMT and bobcat urine, but not non-predator (rabbit) 
urine, induce the activity marker c-Fos in PVN CRH neurons and 
increase blood levels of the stress hormones adrenocorticotropic 
hormone (ACTH) and corticosterone (Extended Data Fig. 6). 

Surprisingly, both predator odours induced a significant increase 
in nArc in PRV* neurons in only one OC area, AmPir (Fig. 2). The 
percentage of AmPir PRV~ neurons expressing nArc was increased 
5.8-fold by TMT and 4.9-fold by bobcat urine, whereas rabbit urine had 
no significant effect. These results suggested that AmPir could play an 
important part in hormonal fear responses to volatile predator odours. 

To explore this idea, we first asked whether chemogenetic activation 
of AmPir induces stress hormone increases. We injected the adeno- 
associated virus (AAV), AAV-DIO-hM3Dq-mCherry” bilaterally into 
the AmPir, pPir or MEA of mice expressing Cre in projection neurons 
(Emx1-IRES-Cre or Vglut2-IRES-Cre mice)*®”” (Fig. 3a—d). This virus 
has Cre-dependent expression of mCherry fused to hM3Dq, a receptor 
that depolarizes neurons upon binding its pharmacologically inert 
ligand, clozapine-N-oxide (CNO)”®. We then injected animals with 
CNO or saline and measured plasma ACTH. 

Chemogenetic activation of AmPir induced a striking (7.6-fold) 
increase in plasma ACTH (Fig. 3e). Activation of MEA, but not pPir, 
also increased plasma ACTH. Consistent with these results, activation 
of AmPir or MEA also induced c-Fos in PVN CRH neurons (Fig. 3f, g). 
Additional analyses showed that mCherry* (AAV-infected) neurons 
were present in a large part of each targeted area (Extended Data Fig. 7) 
and that CNO induced nArc in neurons in each injected area (Fig. 3h, i). 
Those in the AmPir could include neurons responsive to TMT or bob- 
cat urine as well as neurons activated by other odorants. Owing to the 
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areas and the VA (bottom) two areas. See Methods for full names. d, Mean 
percentages of PRV™ neurons in olfactory areas 4 days after injection 

of PRVB316 (n= 24) or PRVB177 (n=6). Error bars indicate standard 
error of the mean (s.e.m.). e-g, Photographs and diagrams of olfactory 
area (blue) sections*’ (red rectangles) with neurons immunostained for 
PRVB316 (e) or PRVB177 (f, g) (green). Scale bars, 200 1m. 


relatively small size of the AmPir, infected neurons were also often 
seen in immediately adjacent olfactory areas (parts of the posterior 
posterolateral cortical amygdala (PLCo) and lateral posteromedial 
cortical amygdala (PMCo)). These results indicated that activation 
of neurons in and around AmPir induces a stress hormone response 
that mimics the instinctive fear response. 

We next used chemogenetic silencing to ask whether the AmPir is 
required for stress hormone responses to predator odours. We injected 
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Figure 2 | Predator odours activate neurons upstream of CRH neurons 
in AmPir. a, AmPir neurons co-labelled (arrowheads) for PRVB177 
(green) and nArc (red) after exposure of mice to bobcat urine or TMT. 
Scale bar, 501m. b, Percentage of PRVB177~* neurons co-labelled for nArc 
in areas of the OC (left) and VA (right) after exposure to TMT, bobcat 
urine, rabbit urine or water. n = 5-6 per condition. Error bars indicate 
s.e.m. *P< 0.05, **P< 0.01, Kruskal-Wallis test with post-hoc Dunn’s test. 
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Figure 3 | Chemogenetic activation of the AmPir induces stress 
hormone increase. a~d, mCherry immunostaining (red) after injection 
of AAV-DIO-hM3Dq-mCherry (a) into the AmPir (b), pPir (c) or MEA 
(d). PSyn, human synapsin 1 promoter. Scale bars, 2mm (left), 400 1m 
(right). e, Plasma ACTH concentration after injection of CNO or saline 
into mice expressing hM3Dq in the pPir, AmPir or MEA. n= 5-6 per 
condition. Error bars indicate s.e.m. *P < 0.05, Mann-Whitney U-test. 
f, PVN neurons co-labelled for Crh (red) and nuclear c-Fos (nc-Fos) 
(green) after injection of saline or CNO into mice expressing hM3Dq in 
the AmPir. Scale bar, 20|1m. g, Percentage of PVN Crh* neurons 
co-labelled for nc-Fos in mice treated as in e. n = 4-6 per condition. Error 
bars indicate s.e.m. *P < 0.05, Mann-Whitney U-test. h, AmPir neurons 
labelled for nArc (green) (arrowheads) after saline or CNO injection of 
mice expressing hM3Dq in the AmPir. Scale bar, 100 1m. i, Number of 
nArct neurons in the AmPir, pPir or MEA in mice treated as ine. n=4 
per condition. Error bars indicate s.e.m. *P < 0.05, unpaired t-test. 


the AmPir of Emx1-Cre mice bilaterally with AAV, AAV-DIO-hM4Di- 
mCherry” (Fig. 4a, b and Extended Data Fig. 8a, b). This virus has 
Cre-dependent expression of hM4Di, which silences neurons upon 
binding to CNO*®. We subsequently injected mice with CNO or saline, 
exposed them to predator odours, and then determined plasma levels 
of stress hormones. 

Silencing of the AmPir dramatically reduced stress hormone 
responses to predator odours (Fig. 4d). Plasma ACTH responses to 
TMT and bobcat urine were decreased by 52 + 11% and 55 +02%, 
respectively (Fig. 4d). These decreases correlated with reductions of 
58+ 03% and 65 + 04% in the percentages of AmPir neurons activated 
by TMT and bobcat urine, as assessed by nArc labelling (Extended 
Data Fig. 9a, b). AmPir silencing also decreased plasma corticosterone 
(Fig. 4e) and the percentage of c-Fos* PVN CRH neurons observed 
after predator odour exposure (Fig. 4f). These results suggested that 
neurons in the AmPir are required for full predator-odour-induced 
increases in stress hormones. 

To examine whether adjacent PLCo/PMCo regions were involved 
in the silencing effects observed for the AmPir, we injected AAV- 
DIO-hM4Di-mCherry selectively into these regions, but not into 
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Figure 4 | Chemogenetic silencing of the AmPir inhibits stress hormone 
responses to predator odours. a-c, mCherry immunostaining (red) after 
injection of AAV-DIO-hM4Di-mCherry (a) into the AmPir (b) or PLCo/ 
PMCo (c). Scale bars, 1 mm (left), 400 1m (right). d, e, Plasma ACTH 

(d) or corticosterone (e) concentration after mice expressing hM4Di 

in the AmPir or PLCo/PMCo were injected with saline or CNO and 

then exposed to bobcat urine or TMT. n= 6-7 per condition. Error bars 
indicate s.e.m. *P < 0.05, Mann-Whitney U-test (d) or unpaired f-test 
(e). f, Percentage of PVN Crh* neurons expressing nc-Fos in mice treated 
as in d, e. n= 5-7 per condition. Error bars indicate s.e.m. *P < 0.05, 
** P< 0.001, unpaired t-test. g, Mice expressing hM4Di in the AmPir 
(d-f) were videotaped during exposure to bobcat urine and videos 

were analysed for the percentage of time spent immobile (percentage 
immobilized). n = 6-7 per condition. Error bars indicate s.e.m. 


the AmPir (Fig. 4c and Extended Data Fig. 8c). In sharp contrast 
to silencing of the AmPir, silencing of these regions did not affect 
predator-odour-induced increases in ACTH and corticosterone or 
CRH neuron activation (Fig. 4d—f and Extended Data Fig. 9c). These 
results clearly indicate that the AmPir, but not adjacent regions of the 
PLCo or PMCo, is involved in predator-odour-induced hormonal fear 
responses. 

Interestingly, AmPir silencing did not affect the ‘freezing behav- 
iour (immobilization) induced by bobcat urine or TMT, a typical fear 
behaviour caused by predator odours (Fig. 4g). A recent study showed 
that inhibition of neurons in the PLCo reduced freezing behaviour 
to TMT'®. Given the effects of AmPir silencing on stress hormone 
responses to predator odours, these findings suggest that different 
OC areas may be involved in the behavioural versus hormonal effects 
of predator odours, with the AmPir and PLCo having major roles in 
their hormonal and behavioural effects, respectively. 

Instinctive fear responses to predator odours are observed in naive 
animals, suggesting the involvement of stereotyped olfactory circuits. 
We have used a combination of monosynaptic and polysynaptic viral 
tracing, indicators of neuronal activation, and chemogenetic activation 
and silencing to investigate how the OC translates volatile predator 
odour signals from the nose into a hormonal fear response. These 
studies implicate only one small area of the OC in this response, the 
AmPir. They reveal that neurons in multiple OC areas can trans- 
mit signals to CRH neurons that control stress hormones, but that 
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fox and bobcat predator odours significantly activate only those in 
the AmPir. Chemogenetic activation of the AmPir induces stress 
hormone increases and its silencing causes a striking decrease in the 
stress hormone response to predator odours. Together, these findings 
suggest that the AmPir plays a key part in generating hormonal fear 
responses to volatile predator cues that signal danger. The functions 
of neurons upstream of CRH neurons in other OC areas are unknown. 
They might also contribute to stress hormone responses. However, 
given that rose oil can block stress hormone responses to a predator 
odour”, it is also conceivable that some transmit signals that suppress 
rather than activate hormonal responses associated with fear. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 16 June 2015; accepted 20 January 2016. 
Published online 21 March 2016. 


1. Apfelbach, R., Blanchard, C. D., Blanchard, R. J., Hayes, R. A. & McGregor, |. S. 
The effects of predator odors in mammalian prey species: a review of field and 
laboratory studies. Neurosci. Biobehav. Rev. 29, 1123-1144 (2005). 

2. Takahashi, L. K. Olfactory systems and neural circuits that modulate predator 
odor fear. Front. Behav. Neurosci. 8, 72 (2014). 

3. Ulrich-Lai, ¥. M. & Herman, J. P. Neural regulation of endocrine and autonomic 
stress responses. Nature Rev. Neurosci. 10, 397-409 (2009). 

4. Kobayakawa, K. et al. Innate versus learned odour processing in the mouse 
olfactory bulb. Nature 450, 503-508 (2007). 

5. Papes, F., Logan, D. W. & Stowers, L. The vomeronasal organ mediates 
interspecies defensive behaviors through detection of protein pheromone 
homologs. Cel! 141, 692-703 (2010). 

6. Buck, L. B. & Bargmann, C. in Principles of Neuroscience (eds Kandel, E., 
Schwartz, J., Jessell, T., Siegelbaum, S. & Hudspeth, A. J.) 712-742 (McGraw-Hill, 
2012). 

7. Neville, KR. & Haberly, L. B. in The Synaptic Organization of the Brain 
(ed. Shepherd, G. M.) 415-454 (Oxford University Press, 2004). 

8. Buck, L. & Axel, R. A novel multigene family may encode odorant receptors: 

a molecular basis for odor recognition. Ce// 65, 175-187 (1991). 

9. Ressler, K. J., Sullivan, S. L. & Buck, L. B. A zonal organization of odorant 
receptor gene expression in the olfactory epithelium. Cell 73, 597-609 (1993). 

10. Vassar, R., Ngai, J. & Axel, R. Spatial segregation of odorant receptor expression 
in the mammalian olfactory epithelium. Cel! 74, 309-318 (1993). 

11. Hanchate, N. K. et al. Single-cell transcriptomics reveals receptor transformations 
during olfactory neurogenesis. Science 350, 1251-1255 (2015). 

12. Ressler, K. J., Sullivan, S. L. & Buck, L. B. Information coding in the olfactory 
system: evidence for a stereotyped and highly organized epitope map in the 
olfactory bulb. Cell 79, 1245-1255 (1994). 

13. Vassar, R. et al. Topographic organization of sensory projections to the olfactory 
bulb. Cell 79, 981-991 (1994). 

14. Mombaerts, P. et a/. Visualizing an olfactory sensory map. Cell 87, 675-686 
(1996). 

15. Sosulski, D. L., Bloom, M. L., Cutforth, T., Axel, R. & Datta, S. R. Distinct 
representations of olfactory information in different cortical centres. Nature 
472, 213-216 (2011). 


106 | NATURE | VOL 532 | 7 APRIL 2016 


16. Miyamichi, K. et al. Cortical representations of olfactory input by trans-synaptic 
tracing. Nature 472, 191-196 (2011). 

17. Ghosh, S. et al. Sensory maps in the olfactory cortex defined by long-range 
viral tracing of single neurons. Nature 472, 217-220 (2011). 

18. Root, C. M., Denny, C. A., Hen, R. & Axel, R. The participation of cortical 
amygdala in innate, odour-driven behaviour. Nature 515, 269-273 
(2014). 

19. Card, J. P. & Enquist, L. W. Transneuronal circuit analysis with pseudorabies 
viruses. Curr. Protoc. Neurosci. Chapter 1, Unit 1.5 (2001). 

20. DeFalco, J. et al. Virus-assisted mapping of neural inputs to a feeding center in 
the hypothalamus. Science 291, 2608-2613 (2001). 

21. Krashes, M. J. et al. An excitatory paraventricular nucleus to AgRP neuron 
circuit that drives hunger. Nature 507, 238-242 (2014). 

22. Sawchenko, P. E. & Swanson, L. W. The organization of forebrain afferents to 
the paraventricular and supraoptic nuclei of the rat. J. Comp. Neurol. 218, 
121-144 (1983). 

23. Guzowski, J. F., McNaughton, B. L., Barnes, C. A. & Worley, P. F. Environment- 
specific expression of the immediate-early gene Arc in hippocampal neuronal 
ensembles. Nature Neurosci. 2, 1120-1124 (1999). 

24. Vernet-Maury, E. in Olfaction and Taste (ed. Van der Starre, H.) 407 (IRL, 1980). 

25. Krashes, M. J. et al. Rapid, reversible activation of AgRP neurons drives feeding 
behavior in mice. J. Clin. Invest. 121, 1424-1428 (2011). 

26. Gorski, J. A. et al. Cortical excitatory neurons and glia, but not GABAergic 
neurons, are produced in the Emx1-expressing lineage. J. Neurosci. 22, 
6309-6314 (2002). 

27. Vong, L. et al. Leptin action on GABAergic neurons prevents obesity 
and reduces inhibitory tone to POMC neurons. Neuron 71, 142-154 
(2011). 

28. Armbruster, B. N., Li, X., Pausch, M. H., Herlitze, S. & Roth, B. L. Evolving the 
lock to fit the key to create a family of G protein-coupled receptors potently 
activated by an inert ligand. Proc. Nat! Acad. Sci. USA 104, 5163-5168 
(2007). 

29. Matsukawa, M., Imada, M., Murakami, T., Aizawa, S. & Sato, T. Rose odor can 
innately counteract predator odor. Brain Res. 1381, 117-123 (2011). 

30. Franklin, K. & Paxinos, G. The Mouse Brain in Stereotaxic Coordinates 3rd edn 
(Academic, 2008). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank L. Enquist for providing the parental PRV, 

PRV TK-BaBlu, and for helpful advice, and members of the Buck laboratory 
for discussions and comments. This work was supported by the Howard 
Hughes Medical Institute (HHMI) and by grants from the National Institutes of 
Health (National Institute on Deafness and Other Communication Disorders) 
(L.B.B.) and the Japan Society for the Promotion of Science (K.K.). L.B.B. is an 
Investigator of the HHMI. 


Author Contributions K.K. and L.B.B. conceived the project, Z.L. and K.K. 
designed and prepared the viruses, X.Y. performed experiments, D.P.O. and 
B.B.L. generated the CRH-Cre mice, K.K. developed methods and performed 
most of the experiments and data analysis, and K.K. and L.B.B. wrote the 
manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the paper. 
Correspondence and requests for materials should be addressed to 

L.B.B. (Ibuck@fhcerc.org). 


© 2016 Macmillan Publishers Limited. All rights reserved 


METHODS 

Mice. CRH-IRES-Cre mice were generated previously”. C57BL/6] wild- 
type mice, Emx1-IRES-Cre and Vglut2-IRES-Cre knock-in mice, and Rosa- 
floxstop-GFP reporter mice were purchased from the Jackson Laboratory. All pro- 
cedures involving mice were approved by the Fred Hutchinson Cancer Research 
Center Institutional Animal Care and Use Committee. 

We used both males and females in all experiments, with similar numbers where 
possible. The same number of each sex was used in experiments shown in Fig. 1 
and Extended Data Figs 2, 3, 5 and 6. 

No statistical methods were used to predetermine sample size. Animals were 

randomly chosen for experimental subjects. Animals were excluded from certain 
experiments. For nArc experiments shown in Fig. 2, animals were excluded if PRV* 
cells were not found in all 5 OC and 3 VA areas analysed. For chemogenetic activa- 
tion or silencing of the AmPir, animals were excluded if more than approximately 
50% of mCherry* cells were observed outside the AmPir. 
Viral vectors. PRVs. PRVB177 and PRVB316 were constructed using homol- 
ogous recombination between targeting vectors and genomic DNA of PRV 
TK-BaBlu, a thymidine kinase (TK)-deleted PRV Bartha strain derivative with 
a LacZ insertion into the gG locus”. For targeting vectors, a flexstop-flanked 
sequence*! encoding a PRV TK fused at its C terminus to a haemagglutinin (HA) 
epitope tag (for PRVB177), or enhanced green fluorescent protein (eGFP) (for 
PRVB316), was first cloned with an inverse orientation into an eGFP-deleted pEG- 
FP-N1 vector (Clontech). Next, Nsil fragments containing a CMV promoter, the 
flexstop-flanked coding sequence, and an $V40 polyadenylation signal were cloned 
between gG locus sequences matching those 5’ and 3’ to the lacZ sequence in 
PRV TK-BaBlu to give the final targeting vectors. These vectors were then lin- 
earized and co-transfected with PRV TK-BaBlu genomic DNA into HEK 293T 
cells (ATCC). Recombinant virus clones were selected and confirmed following 
methods described previously”. 

To propagate recombinant PRVs, PK15 cells (ATCC) were infected with the 
viruses using a multiplicity of infection (m.o.i.) =0.1-0.01. After being infected, 
cells showed a prominent cytopathic effect (~2 days). They were harvested by 
scraping, and the cell material was frozen using liquid nitrogen and then quickly 
thawed in a 37°C water bath. After three freeze-thaw cycles, cell debris was 
removed by centrifugation twice at 1,000g for 5 min and the supernatant was then 
used for experiments. The titre of viral stocks was determined using standard 
plaque assays on PK15 cells”, with titres expressed in plaque-forming units (p.f.u.). 
Lentivirus. To generate LVF2TK, a flexstop-flanked sequence encoding TK-HA 
(see earlier) was cloned into the pLenti6.3 vector (Thermo Fisher). LVF2TK was 
produced using the ViraPower HiPerform Lentiviral Expression System (Thermo 
Fisher) according to the manufacturer’s instructions. Virus was concentrated using 
ultracentrifugation as described previously**. Viral titre was measured using the 
UltraRapid Lentiviral Titer Kit (System Biosciences) and titres were described in 
infectious units (i.f.u). 

AAVs. Serotype 8 AAVs with Cre-recombinase-dependent flexstop cassettes that 
permit expression of mCherry-fused hM3Dq or mCherry-fused hM4Di under 
the control of the human synapsin promoter (AAV-DIO-hM3Dq-mCherry or 
AAV-DIO-hM4Di-mCherry) were purchased from the Vector Core at the 
University of North Carolina at Chapel Hill (the UNC vector core). Amount used 
is described in virus particles (v.p.). 

Stereotaxic injection. Mice aged 2-6 months were used for injection. Viruses 
were injected into the brain using a Stereotaxic Alignment System (David Kopf 
Instruments) with an inhalation anaesthesia of 2.5% isoflurane. Virus suspensions 
(PRVs: 1-1.5 x 10° p.f.u. (1 ul); LVF2TK: 1-1.5 x 10° ifu (1 ul); AAVs: 1-3 x 10° 
v.p. (200-330 nl)) were loaded into a 1-11 syringe, and injected at 100 nl per minute. 
The needle was inserted to the target locations based on a stereotaxic atlas*’. After 
recovery, animals were singly housed with regular 12h dark/light cycles, and food 
and water were provided ad libitum. 

Odour exposure. Mice were exposed to a predator odour or distilled water within 
a modified polycarbonate vacuum-desiccator, as described previously**. A sin- 
gle mouse was placed on a platform in a chamber of 15cm diameter with input 
and output ports, exposed to charcoal-filtered air for 16h, and then to filtered air 
bubbled through 12 mM 2,5-dihydro-2,4,5-trimethylthiazoline (TMT) (Contech) 
diluted in water, water alone, bobcat urine (Maine outdoor solutions or Murray’s 
lures and trapping), or rabbit urine (Kishel’s quality animal scents and lures). For 
detection of nArc in PRV-infected cells, CRH-Cre mice injected with PRVB177 4 
days earlier were exposed to odours for 5 min. For analysis of c-Fos expression in 
CRH neurons, CRH-Cre mice crossed with Rosa-floxstop-GFP mice were exposed 
to odours for 10 min and then to clean air for 50 min. All odour exposures were 
performed between 9:00 and 11:00 a.m. 

Plasma ACTH and corticosterone assays. After mice were killed by cervical dis- 
location and decapitation, trunk blood was collected directly into blood collection 
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tubes (Becton Dickinson) containing 5011 aprotinin (Phoenix Pharmaceuticals). 
Plasma was obtained by centrifugation at 1,600g for 15 min at 4°C, and stored at 
—80°C. Plasma ACTH concentrations were measured using the ACTH ELISA 
kit (MD Biosciences), according to the manufacturer's instructions, with the fol- 
lowing modifications: (1) 1001] of the controls or blood plasma combined with 
100.1 of PBS (pH 7.4) was used in the place of 200,11 plasma; and (2) the results 
were assessed with the QuantaRed Enhanced Chemifluorescent HRP Substrate 
(Thermo Fisher). Plasma corticosterone concentrations were measured using 
the corticosterone ELISA kit (Abcam), according to the manufacturer's instruc- 
tions, with the following modifications: (1) plasma was diluted 25 times instead 
of 100 times with buffer M; and (2) the results were assessed with the QuantaRed 
Enhanced Chemifluorescent HRP Substrate. Fluorescence was measured with a 
CytoFluor4000 plate reader (Applied Biosystems). 

Immunofluorescence. Animals were perfused transcardially with 4% paraform- 
aldehyde (PFA). Their brains were then soaked in 4% PFA for 4h, in 30% sucrose 
for 48h, and then frozen in OCT (Sakura) and cut into 14—20-\1m coronal sections 
using a cryostat. Brain sections were washed twice with PBS, permeabilized with 
0.5% Triton X-100 in PBS for 5 min, washed twice with PBS, blocked with TNB 
(Perkin Elmer) for 1h at room temperature, and then incubated with primary 
antibodies diluted in TNB at 4°C overnight. Sections were then washed three times 
with TNT (0.1 M Tris pH 7.5, 150 mM NaCl, 0.05% Tween), incubated with the 
appropriate secondary antibodies and 0.5 1g ml! 4’,6-diamidino-2-phenylindole 
(DAPI; Sigma) for 1h at room temperature, and washed three times with TNT. 
Slides were coverslipped with Fluoromount-G (Southern Biotech). The following 
antibodies were used: (1) biotinylated mouse anti-HA (BioLegend, #901505, 1:300) 
followed by Alexa488-Streptavidin (Thermo Fisher, #$32354, 1:1,000) for polysyn- 
aptic PRV tracing; (2) goat anti-GFP (Rockland, #600- 101-215, 1:1,000) and bioti- 
nylated anti-HA (BioLegend, 1:300) followed by Alexa488 donkey anti-goat IgG 
(Thermo Fisher, #A11055, 1:1,000) and Alexa555-Streptavidin (Thermo Fisher, 
#532355, 1:1,000) for monosynaptic PRV tracing; (3) rabbit anti-GFP (Thermo 
Fisher, #A-11122, 1:500) and goat anti-c-Fos (Santa Cruz, #sc-52G, 1:300) fol- 
lowed by Alexa488 donkey anti-rabbit IgG (Thermo Fisher, #A21206, 1:1,000) and 
Alexa555 donkey anti-goat IgG (Thermo Fisher, #A21432, 1:1,000) for analysis of 
c-Fos expression in CRH neurons. 

In situ hybridization. In situ hybridization was performed essentially as described 
previously!!9, with some experiments using additional steps for dual staining. 
Coding region fragments of Arc, Vglut1, Vglut2, Gad1, Gad2 and Crh, and the first 
intron sequence of c-Fos mRNA (for nuclear c-Fos staining) were isolated from 
mouse brain cDNA or mouse genomic cDNA using PCR, and cloned into the pCR4 
Topo vector (Thermo Fisher). Digoxigenin (DIG)- or fluorescein (FLU)-labelled 
cRNA probes (riboprobes) were prepared using the DIG or FLU RNA Labelling 
Mix (Roche). Brains were frozen in OCT, and 16-20-|1m coronal cryostat sections 
were hybridized to DIG- and/or FLU-labelled cRNA probes at 56°C for 13-16h. 
nArc mRNA staining. After hybridization, sections were washed twice in 
0.2 x SSC at 63°C for 30 min, incubated with peroxidase (POD)-conjugated anti- 
DIG antibodies (Roche, #11207733910, 1:2,000) at 37°C for 2h, and then treated 
using the TSA-plus FLU kit (Perkin Elmer). Sections were then coverslipped with 
Fluoromount-G with DAPI (Southern Biotech). 

Co-staining for nArc, Vglut1/2 or Gad1/2 mRNA and HA (PRVB177). After hybrid- 
ization, sections were washed twice in 0.2 SSC at 63°C for 30 min, incubated 
with POD-conjugated anti-DIG antibodies (Roche, #11207733910, 1:2,000) and 
biotinylated anti-HA antibodies (BioLegend, #901505, 1:300) at 37°C for 2h, and 
then treated using the TSA-plus Cy3 kit (Perkin Elmer). Sections were then incu- 
bated with 0.5 1g ml"! DAPI and Alexa488-Streptavidin (Thermo Fisher, 1:1,000) 
at room temperature for 1h, and were coverslipped with Fluoromount-G. 
Co-staining with Crh and nuclear c-Fos riboprobes. After hybridization, sections 
were washed twice in 0.2 SSC at 63°C for 30 min, blocked with Streptavidin/ 
Biotin Blocking kit (Vector Laboratories), incubated with POD-conjugated 
anti-FLU antibodies (Roche, #11426346910, 1:300) and alkaline phosphatase 
(AP)-conjugated anti-DIG antibodies (Roche, #11093274910, 1:300) at room tem- 
perature for 2h, and then treated using the TSA-plus Biotin kit (Perkin Elmer). 
Sections were next incubated with 0.5,1gml~' DAPI and Alexa488-Streptavidin 
(Thermo Fisher, 1:1,000) at room temperature for 30 min, incubated with HNPP 
Fluorescent Detection Set (Roche) at room temperature for 1h, and then covers- 
lipped with Fluoromount-G, 

Chemogenetic neuronal activation and silencing. Activation. AAV-DIO-hM3Dq- 
mCherry was injected into the pPir, AmPir or MEA of Emx1-IRES-Cre or Vglut2- 
IRES-Cre mice by stereotaxic injection (see earlier). At >2 weeks after injection, 
mice were intraperitoneally injected with clozapine-N-oxide (CNO; Sigma) 
(5.0mgkg ' body weight) or control saline. Twenty minutes later, trunk blood and 
brain were collected and the blood was used for plasma ACTH or corticosterone 
assays (see earlier). Brains were fixed with 4% PFA for 4h, soaked in 30% sucrose 
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for 24h, frozen in OCT, and cut into 20-|1m coronal sections using a cryostat. 
To analyse expression of hM3Dq-mCherry, brain sections were immunostained 
with rabbit anti-RFP (Rockland, #600-410-379, 1:500) followed by Alexa555 don- 
key anti-rabbit IgG (Thermo Fisher, #431572, 1:1,000) antibodies (see earlier). 
Silencing. AAV-DIO-hM4Di-mCherry was injected into the AmPir of Emx1- 
IRES-Cre mice by stereotaxic injection (see earlier). At >2 weeks after injection, 
mice were intraperitoneally injected with CNO (5.0 mgkg”! body weight) or con- 
trol saline. Forty minutes later, mice were exposed to bobcat urine or TMT for 
10 min (see earlier). Animal behaviour during exposure was recorded with a Canon 
PowerShot ELPH300HS camera placed above the chamber, and the duration that 
mice stayed immobile during the 10-min odour exposure were scored by using 
Ethovision XT11 software (Noldus Information Technology). Trunk blood and 
brain were then collected. Blood was used for plasma ACTH or corticosterone 
assays (see earlier). Brains were treated and immunostained with rabbit-anti-RFP 
antibodies (see earlier). 

Cell counting. Images were collected using an AxioCam camera and 
AxioImager.Z1 microscope with apotome device (Zeiss), or TissueFAXS system 
(Tissuegnostics). Brain structures were identified microscopically and in digital 
photos using a mouse brain atlas*”. 

To analyse the locations and number of PRV-infected cells in brain areas, every 
fifth section was analysed, and numbers of PRV* cells were multiplied by five to 
acquire approximate total number of cells per animal. Brain areas were judged 
to contain upstream neurons if they contained >10 labelled neurons in >50% 
of animals. 

To analyse the location and number of virus-infected and nArc-expressing cells, 
three of five sections were analysed. The number of PRV* cells with or without 
nArc mRNA was blindly counted. The percentages of PRV* cells with only nArc 
signals among all PRV* cells lacking cytoplasmic Arc were calculated. 

To analyse the activation of CRH neurons in chemogenetic experiments, the 
numbers of Crh* neurons labelled for nc-Fos mRNA were blindly counted. 

To analyse the distribution of virus-infected cells, all sections through the 
regions of interest were analysed. The section images were aligned from anterior 
to posterior, and the numbers of virus-infected cells were counted. The size of 
each brain was adjusted linearly so that each posterior piriform cortex had the 
same length (2.8 mm). 

Statistical analysis. All data are shown as the mean + s.e.m. Data were tested 
with the Shapiro-Wilk test for normality. For data with a normal distribution, 
the unpaired t-test or one-way ANOVA with post-hoc Dunnett’s test was used to 
compare two groups or more than two groups, respectively, to analyse statistical 
significance. For t-tests, equality of variances was analysed with the F-test and 
Welch's correction was employed when variances of populations were significantly 
different. For data without a normal distribution, the Mann-Whitney U-test or the 
Kruskal-Wallis test with post-hoc Dunn's test was used to compare two groups or 
more than two groups, respectively. All tests were two-sided. Detailed information 
on the numbers of animals used, statistical analyses, and effect sizes is provided 
in Supplementary Table 1. 

Abbreviations for brain areas. Abbreviations used for brain areas are according 
to Franklin and Paxinos*”, with a few minor modifications. 

Olfactory areas. ACo, anterior cortical amygdala; AON, anterior olfactory nucleus; 
AmPir, amygdalo-piriform transition area; CxA, cortex-amygdala transition 
zone; LEnt, lateral entorhinal cortex; LOT, nucleus of lateral olfactory tract; 
MEAa, medial amygdala, anterior part; MEAp, medial amygdala, posterior part; 


OT, olfactory tubercle; aPir, piriform cortex, anterior part; pPir, piriform cortex, 
posterior part; PLCo, posterolateral cortical amygdala; PMCo, posteromedial cor- 
tical amygdala; TTT, tenia tecta. 

Non-olfactory areas. AA, anterior amygdaloid area; AcbSh, accumbens nucleus, 
shell; AH, anterior hypothalamic area; AHi, amygdalohippocampal area; AI, 
agranular insular cortex; ARC, arcuate hypothalamic nucleus; AVPe, anteroventral 
periventricular nucleus; BLA, basolateral amygdala, anterior part; BLP, basolateral 
amygdaloid nucleus, posterior part; BMA, basomedial amygdala, anterior part; 
BMP, basomedial amygdaloid nucleus, posterior part; BNSTa, bed nucleus of the 
stria terminalis, anterior part; BNSTp, bed nucleus of the stria terminalis, posterior 
part; CA1, field CA1 of the hippocampus; CA3, field CA3 of the hippocampus; 
CEA, central amygdala; CEnt, caudomedial entothinal cortex; CI, caudal intersti- 
tial nucleus of the medial longitudinal; CVL, caudoventrolateral reticular nucleus; 
DEn, dorsal endopiriform claustrum; DI, dysgranular insular cortex; DMH, dor- 
somedial hypothalamic nucleus; DP, dorsal peduncular cortex; DRN, dorsal raphe 
nucleus; Ect, ectorhinal cortex; 7N, facial nucleu; Gi, gigantocellular reticular 
nucleus; HDB, nucleus of the horizontal limb of the diagonal band; I, intercalated 
nuclei of the amygdala; IPAC, interstitial nucleus of the posterior limb of the ant; 
IPC, interpeduncular nucleus, caudal subnucleus; IPR, interpeduncular nucleus, 
rostral subnucleus; IRt, intermediate reticular nucleus; La, lateral amygdala; LA, 
lateroanterior hypothalamic nucleus; LC, locus coeruleus; LDTg, laterodorsal teg- 
mental nucleus; LH, lateral hypothalamic area; LPAG, lateral periaqueductal grey; 
LPGi, lateral paragigantocellular nucleus; LPO, lateral preoptic area; LRt, lateral 
reticular nucleus; LS, lateral septal nucleus; MCLH, magnocellular nucleus of the 
lateral hypothalamus; Md, medullary reticular nucleus; MEnt, medial entorhi- 
nal cortex; MnPO, median preoptic nucleus; MnR, median raphe nucleus; MPA, 
medial preoptic area; MPO, medial preoptic nucleus; MTu, medial tuberal nucleus; 
NLL, nucleus of the lateral lemniscus; NTS, nucleus of the solitary tract; p1PAG, 
pl periaqueductal grey; PaS, parasubiculum; PBN, parabrachial nucleus; PCRt, 
parvicellular reticular nucleus; Pe, periventricular nucleus of the hypothalamus; 
PE, parafascicular thalamic nucleus; PH, posterior hypothalamic nucleus; PLH, 
peduncular part of lateral hypothalamus; PMD, premammillary nucleus, dorsal 
part; PMV, premammillary nucleus, ventral part; PO, paraolivary nucleus; Pr, 
prepositus nucleus; PRh, perirhinal cortex; PSTh, parasubthalamic nucleus; PVN, 
paraventricular nucleus of the hypothalamus; RCh, retrochiasmatic area; RM, 
retromammillary nucleus; SCh, suprachiasmatic nucleus; SHy, septohippocam- 
pal nucleus; SNC, substantia nigra; SO, supraoptic nucleus; StHy, striohypotha- 
lamic nucleus; STIA, bed nucleus of the stria terminalis, intraamygdaloid; STr, 
subiculum, transition area; SubC, subcoeruleus nucleus; VDB, vertical limb of 
the diagonal band; VLPO, ventrolateral preoptic nucleus; VMH, ventromedial 
hypothalamic nucleus; VP, ventral pallidum; VS, ventral subiculum; ZI, zona 
incerta. 


31. Schnitgen, F. et a/. A directional strategy for monitoring Cre-mediated 
recombination at the cellular level in the mouse. Nature Biotechnol. 21, 
562-565 (2003). 

32. Banfield, B. W. & Bird, G. A. Construction and analysis of alphaherpesviruses 
expressing green fluorescent protein. Methods Mol. Biol. 515, 227-238 (2009). 

33. Tiscornia, G., Singer, O. & Verma, |. M. Production and purification of lentiviral 
vectors. Nature Protocols 1, 241-245 (2006). 

34. Boehm, U., Zou, Z. & Buck, L. B. Feedback loops link odor and pheromone 
signaling with reproduction. Ce// 123, 683-695 (2005). 

35. Liberles, S. D. & Buck, L. B. A second class of chemosensory receptors in the 
olfactory epithelium. Nature 442, 645-650 (2006). 


© 2016 Macmillan Publishers Limited. All rights reserved 


a 
b 
PRVB316 LVF2TK 
ff 
c 7 d 
10 : 
GH Inside PVN 
M8 Outside PVN 

x) 

x0) 2 
(S) [3 
+ rs) 
> + 
w > 
= o 
[e) = 
is [e) 
oO 
# 8 
: z 
2 Zz 


PRVB316 PRVB316 PRVB316 


+ + 
LVF2TK LVF2TK 
CRH-Cre WT 


Extended Data Figure 1 | Virus infection and spread in CRH-Cre and 
wild-type mice. a, Photographs show a PVN section from CRH-Cre mice 
crossed with Rosa-floxstop-GFP mice that was co-stained with anti-GFP 
antibodies (green) (Cre) and a Crh riboprobe (red) (Crh). Co-labelled cells 
(yellow) can be seen in the merged image, verifying the expression of Cre 
in CRH neurons. Of GFP* (Cre*) neurons in PVN sections, 93.9 +0.5% 
expressed Crh. n= 4. Scale bar, 100m. b, Photographs show a section 
through the PVN of a CRH-Cre mouse previously injected with PRVB316 
and LVE2TK. The section was co-stained with antibodies for Cre- 
dependent reporters of PRVB316 (GFP (green)) and LVF2TK 

(HA (red)). Some co-labelled neurons are seen (arrows), verifying 
co-infection of some neurons by the two viruses. Scale bar, 200 1m. c, The 
number of PRVB316" cells inside or outside the PVN, 4 days after PVN 
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injection of PRVB316 alone (m= 2) or PRVB316+LVF2TK (n= 24) in 
CRH-Cre mice or PRVB316+LVF2TK in wild-type (WT) mice (n= 3). 
Error bars indicate s.e.m. In CRH-Cre mice, PRVB316* cells were seen 
both inside and outside the PVN after co-injection of PRVB316 and 
LVF2TK, but only inside the PVN when only PRVB316 was injected. In 
wild-type mice co-injected with the two viruses, PRV* cells were not 
detected either inside or outside the PVN. d, The number of PRV* cells 
inside or outside the PVN 3 or 5 days after PVN injection of PRVB177 in 
CRH-Cre or wild-type mice. n = 8 (CRH-Cre mice, day 3), n = 6 (CRH- 
Cre mice, day 5), n= 2 (wild-type mice, day 3), n = 2 (wild-type mice, 
day 5). Error bars indicate s.e.m. PRVB177* neurons were seen both 
inside and outside the PVN of CRH-Cre mice on both days, but were not 
detected in wild-type mice at either location. 
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Extended Data Figure 2 | Brain areas with neurons upstream of 

CRH neurons. The percentage of all non-PVN PRV* neurons found in 
individual brain areas 4 days after PVN injection of the monosynaptic 
PRV (PRVB316; Mono) (n= 24) or on day 3, 4 or 5 after PVN injection of 
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the polysynaptic PRV (PRVB177; Poly) (n= 8, 6 and 6, for day 3, 4 and 5, 
respectively). Error bars indicate s.e.m. See Methods for full names of 
abbreviated brain areas. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Mono 


Poly 


Olfactory cortex 
> 
fe) 
fo} 


Day3 | Day4 | Day5 


Vomeronasall| 
amygdala 
= 
m 
> 

no] 


Amygdala 
fey] 
i 
> 


DP 
AV/DI/GI 
Ect 
PRh 
CEnt 
MEnt 
DEn 


Cortex 


10 100 250 


Extended Data Figure 3 | Brain areas containing neurons upstream 
of CRH neurons. Coloured boxes indicate the approximate number of 
infected neurons seen in individual brain areas (white indicates none) 
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PRV (PRVB316; Mono) (n= 24) or on day 3, 4 or 5 after injection with 
the polysynaptic PRV (PRVB177; Poly) (n =8, 6, and 6, for day 3, 4 and 5, 
respectively). See Methods for full names of abbreviated brain areas. 
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Extended Data Figure 4 | Biased distribution of neurons upstream 

of CRH neurons in the pPir, AmPir and MEA. a, b, Diagrams show 

the numbers of PRVB177-infected (HA*) neurons on day 4 after PVN 
injection in sequential 20-{1m coronal sections along the anterior 
(A)-posterior (P) axis in different areas of the OC (a) and VA (b). Data are 
shown in separate panels for four animals for each area. Infected neurons 
appear to be randomly distributed along the anterior—posterior axis in 
some areas, but not others. c-e, Photographs of sections immunostained 
for PRVB177 (HA, green) on day 4 after PVN injection of PRVB177. 
Sections were counterstained with 4’,6-diamidino-2-phenylindole (DAPI). 
More PRVB177* neurons are seen in the posterior than anterior pPir (c) 
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and AmPir (d). In the MEA (e), more are seen in the posteroventral (pv) 
quadrant than other quadrants (posterodorsal (pd), anterodorsal (ad), and 
anteroventral (av)). Scale bars, 200 1m. f-h, The percentages of PRVB177- 
infected neurons in the anterior versus posterior halves of the pPir (f) and 
AmPir (g) and in different quadrants of the MEA (h). n= 4. Error bars 
indicate s.e.m. *P < 0.05, **P< 0.01, ***P< 0.001. Paired t-test (f, g) and 
paired one-way analysis of variance (ANOVA) with post-hoc Tukey’s test 
(h). Infected neurons were concentrated in the posterior halves of the pPir 
and AmPir and were more abundant in the posteroventral quadrant of the 
MEA than in other quadrants. 


Limited. All rights reserved 


LETTER 


a d 
C3 Voglut1/2+ Co Gad1/2+ 
604° eo ‘ ° 
e 
e e 
< e 
pPir a 40 ; * 
oO e @ 
+ e 
& é 
oa e 
‘S 204 
3S e | 
e 
0 
‘ fe) eo h[6F¢ ‘ © re) 
2825 255 8 
a “- €¢€ FS FS & 


Extended Data Figure 5 | Expression of glutamatergic and GABAergic only the MEAp contained many co-labelled for Gad1/2. Scale bars, 100 1m. 
markers in PRV-infected neurons. a—c, Photographs of sections d, The percentage of PRVB177* neurons co-labelled for glutamatergic 
through the pPir (a), AmPir (b) or MEAp (c) 4 days after PVN injection (Vglut1/2*) or GABAergic (Gad1/2*) markers in different areas of the 

of PRVB177. Sections were co-stained with antibodies against HA OC and VA. PRV* neurons unlabelled by either probe could express other 
(PRVB177) (green) and riboprobes (red) for Vglut1 and Vglut2 (Vglut1/2),  neurotransmitters or neuromodulators or reflect PRV effects on RNA 

to identify glutamatergic neurons (Glu), or Gad1 and Gad2 (Gad1/2), transcription or degradation. n = 4 mice per condition. Error bars indicate 
to identify GABAergic neurons (GABA). Sections were counterstained s.e.m. Most of the co-labelled PRV™ cells in the olfactory areas were 

with DAPI. Arrowheads indicate co-labelled neurons. Numerous co-labelled for the glutamatergic makers, but the MEA also contained 
PRV-infected neurons were co-labelled for Vglut1/2 in all three areas, but some co-labelled for the GABAergic markers. 
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Extended Data Figure 6 | TMT and bobcat urine activate CRH 
neurons and induce stress hormone increases. a, Photographs of PVN 
sections from CRH-Cre mice crossed with Rosa-floxstop-GFP mice. 
Mice were exposed to water (HO), rabbit urine, bobcat urine or TMT 
and sections were then co-stained with antibodies against GFP (green), 
to identify CRH neurons, and c-Fos (red), to detect activated neurons. 
Higher magnifications of boxed areas are shown on the right. Many CRH 
neurons (GFP*) expressed c-Fos after exposure to bobcat urine or TMT, 
but not rabbit urine. Scale bars, 100 1m (left) and 20 1m (right). b, The 
percentage of PVN CRH (GFP*) neurons immunostained for c-Fos after 
odour exposure of CRH-Cre mice crossed with Rosa-floxstop-GFP mice. 


Exposure to bobcat urine or TMT, but not rabbit urine, increased the 
percentage of CRH neurons expressing c-Fos. n = 6 mice per condition. 
Error bars indicate s.e.m. ***P < 0.001, one-way ANOVA with post-hoc 
Dunnett’s test. c, d, Plasma ACTH (c) or corticosterone (d) concentrations 
were measured in wild-type mice after odour exposure. Exposure of 
animals to TMT or bobcat urine increased blood levels of ACTH and 
corticosterone, whereas exposure to rabbit urine did not. n =6 mice per 
condition. Error bars indicate s.e.m. *P < 0.05, **P< 0.01, ***P< 0.001, 
one-way ANOVA with post-hoc Dunnett’s test (c) or Kruskal-Wallis test 
with post-hoc Dunn’ test (d). 
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Extended Data Figure 7 | Locations of infected neurons after injection immunostained sections. The target area (AmPir, pPir or MEA) is 

of AAV-DIO-hM3Dq-mCherry. a-c, AAV-DIO-hM3Dq-mCherry indicated in purple for each section. Regions that contained the nuclei of 
was injected into the AmPir (a), pPir (b) or MEA (c) and brain sections mCherry* neurons are indicated in red. Schematic sections are shown 
immunostained for mCherry to detect infected neurons. Shown are for six mice per area that were injected with CNO to test the effects of 
schematic sections from a mouse brain atlas*’ that correspond to activating the area. 
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Extended Data Figure 8 | Locations of infected neurons after injection (a-c), and the location of the PLCo/PMCo is indicated in green (c). 


of AAV-DIO-hM4Di-mCherry. a~c, AAV-DIO-hM4Di-mCherry was Regions that contained the nuclei of mCherry* neurons are indicated 
injected into the AmPir (a, b) or PLCo/PMCo (c) and brain sections in red in each section. Schematics are shown for six mice per area (per 
immunostained for mCherry to detect infected neurons. Shown are condition) injected with CNO and then exposed to bobcat urine (a, c) 
schematic sections from a mouse brain atlas* that correspond to or TMT (b) to test the effects of silencing the AmPir or PLCo/PMCo. 


immunostained sections. The location of the AmPir is indicated in purple 
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Extended Data Figure 9 | Effect of neuronal silencing on the activation 
of neurons by predator odours. a~c, AAV-DIO-hM4Di-mCherry was 
injected into the AmPir (a, b) or PLCo/PMCo (c). Animals were later 
injected with CNO or saline and exposed to bobcat urine (a, c) or TMT 
(b). Brain sections were hybridized with an Arc riboprobe and the number 
of cells labelled for nArc was determined for the AmPir, PLCo and PMCo. 
Histograms show the number of cells with nArc in each area after injection 
with CNO or saline and exposure to predator odour. On the basis of 


the numbers of nArc* neurons, silencing of AmPir neurons caused a 
significant decrease in AmPir and PLCo neurons activated by bobcat urine 
(a) and by AmPir (but not PLCo) neurons activated by TMT (b). Silencing 
of the PLCo/PMCo (c) caused a significant decrease in neurons activated 
by bobcat urine in the PLCo, but not the AmPir or PMCo. n = 6-7 per 
condition. Error bars indicate s.e.m. *P < 0.05, **P < 0.01, ***P < 0.001, 


unpaired t-test. 
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Derivation and differentiation of haploid human 


embryonic stem cells 


Ido Sagi!, Gloryn Chia, Tamar Golan-Lev!, Mordecai Peretz!, Uri Weissbein!, Lina Sui’, Mark V. Sauer?, Ofra Yanuka!, 


Dieter Egli?+ & Nissim Benvenisty! 


Diploidy is a fundamental genetic feature in mammals, in which 
haploid cells normally arise only as post-meiotic germ cells that 
serve to ensure a diploid genome upon fertilization. Gamete 
manipulation has yielded haploid embryonic stem (ES) cells from 
several mammalian species!~°, but haploid human ES cells have 
yet to be reported. Here we generated and analysed a collection 
of human parthenogenetic ES cell lines originating from haploid 
oocytes, leading to the successful isolation and maintenance 
of human ES cell lines with a normal haploid karyotype. 
Haploid human ES cells exhibited typical pluripotent stem cell 
characteristics, such as self-renewal capacity and a pluripotency- 
specific molecular signature. Moreover, we demonstrated the utility 
of these cells as a platform for loss-of-function genetic screening. 
Although haploid human ES cells resembled their diploid 
counterparts, they also displayed distinct properties including 
differential regulation of X chromosome inactivation and of 
genes involved in oxidative phosphorylation, alongside reduction 
in absolute gene expression levels and cell size. Surprisingly, we 
found that a haploid human genome is compatible not only with 
the undifferentiated pluripotent state, but also with differentiated 
somatic fates representing all three embryonic germ layers both 
in vitro and in vivo, despite a persistent dosage imbalance between 
the autosomes and X chromosome. We expect that haploid human 
ES cells will provide novel means for studying human functional 
genomics and development. 

Haploid genetics is useful for delineating genome function. Although 
haploid genetics has been mostly restricted to unicellular organisms, 
recent reports of haploid ES cells have extended it into animal species, 
including mammals’’. Deriving haploid human ES cells by similar 
approaches has probably been hindered by the limited availability of 
human oocytes®. Artificial activation of unfertilized metaphase II (MII) 
human oocytes results in efficient development to the blastocyst stage 
and subsequent derivation of parthenogenetic ES (pES) cells?-'!. In 
mouse parthenogenetic embryos haploidy usually persists at the blas- 
tocyst stage’*!, but diploid cells progressively dominate over increas- 
ing cell cycles due to spontaneous and irreversible diploidization'*"° 
(Fig. 1a). By estimation, even if diploidization occurs in 1 out of 10 cell 
cycles, 1% of ES cells may remain haploid at early passages (Extended 
Data Fig. 1a). 

To explore the feasibility of deriving haploid human ES cells, we gen- 
erated and analysed a collection of 14 early-passage human pES cell 
lines for the persistence of haploid cells. We initially used chromosome 
counting by metaphase spreading as a method for unambiguous and 
quantitative discovery of rare haploid nuclei. Among ten individual 
pES cell lines, a low proportion of haploid metaphases was found in 
one cell line, pES10 (1.3%, Extended Data Table 1). In four additional 
lines, we also used viable fluorescence-activated cell sorting (FACS) 
with Hoechst 33342 staining, aiming to isolate cells with a DNA content 


corresponding to less than two chromosomal copies (2c), leading to the 
successful enrichment of haploid cells from a second cell line, pES12 
(Extended Data Table 2). 
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Figure 1 | Derivation of haploid human ES cells. a, Schematic of 
putative haploidy in pES cells. b, DNA content profiles of haploid 

pES10, established by repeated enrichment of 1c cells. Top to bottom, 
unsorted diploid cells, partially purified (fourth sort) and mostly purified 
(sixth sort) haploid cells. c, pES10 karyotypes before and after 1c-cell 
enrichment. p, passage. d, e, DNA FISH (d) and centromere staining (e) 
in haploid-enriched pES10 cells. Magnifications show haploid nuclei with 
a single hybridization signal (d) and 23 centromere foci (e), respectively. 
Scale bars, 101m. 
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Figure 2 | Haploid human ES cells display pluripotent stem cell 
characteristics and enable loss-of-function screening. a, b, Colony 
morphology and alkaline phosphatase staining. Scale bars, 501m. 

c, Flow cytometry analysis of gated h-pES10 1c cells by co-staining DNA 
and cell surface markers TRA-1-60 and CLDN6. d, Expression levels of 
pluripotency genes in Gl-sorted haploid and diploid ES cells (n= 4 each, 


Two individual haploid-enriched ES cell lines were established from 
pES10 and pES12 (hereafter referred to as h-pES10 and h-pES12) 
within five to six rounds of 1c-cell enrichment and expansion (Fig. 1b 
and Extended Data Fig. 1b). These cell lines were cultured in standard 
conditions for over 30 passages while including cells with a normal 
haploid karyotype (Fig. 1c and Extended Data Fig. 1c). However, 
since diploidization occurred at a rate of 3-9% cells per day (Extended 
Data Fig. 1a; see Methods), sorting at every three to four passages was 
required to maintain haploid cells. Visualization of ploidy was further 
enabled by DNA fluorescence in situ hybridization (FISH) (Fig. 1d 
and Extended Data Fig. 1d) and quantification of centromere protein 
foci (Fig. le and Extended Data Fig. le; see Supplementary Notes and 
Extended Data Fig. 2). Besides having an intact karyotype, haploid ES 
cells did not harbour significant copy number variations (CNVs) rela- 
tive to their unsorted diploid counterparts (Extended Data Fig. 1f), nor 
common duplications that would result in pseudo-diploidy, indicating 
that genome integrity was preserved throughout haploid-cell isolation 
and maintenance. 

Both h-pES10 and h-pES12 exhibited classical human pluripotent 
stem cell features, including typical colony morphology and alkaline 
phosphatase activity (Fig. 2a, b). Single haploid ES cells expressed 
hallmark pluripotency markers as confirmed by centromere foci 
quantification in essentially pure (>95%) haploid cultures (Extended 
Data Fig. 3). Selective flow cytometry enabled the validation of 
the expression of two human ES-cell-specific cell surface markers 
(TRA-1-60 and CLDN6 (ref. 16)) in single haploid cells (Fig. 2c). 
Moreover, sorted haploid and diploid ES cells showed highly similar 
transcriptional and epigenetic signatures of pluripotency genes 
(Fig. 2d, e). In accordance with a parthenogenetic origin, these cells 
also featured distinct molecular profiles of maternal imprinting 
(Extended Data Fig. 4a-d). 
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two biological replicates per line, logarithmic scale). Mean indicated 

by bars. e, DNA methylation levels at pluripotency genes in biological 
duplicates of G1-sorted ES cells and control fibroblasts (Fib). f, Schematic 
of gene trapping and screening for 6-TG-resistance genes. g, NUDT5 
insertions (red arrows) detected in three 6-TG-resistant colonies 
(indicated by TTAA). h, Metabolic pathway leading to 6-TG toxicity. 


Haploid cells are valuable for genetic screening because phenotyp- 
ically selectable mutants can be identified upon disruption of single- 
copy alleles. To demonstrate the applicability of this principle in 
haploid human ES cells, we generated a genome-wide mutant library 
using a gene trap transposon system (Fig. 2f and Extended Data Fig. 4e; 
see Methods), and screened for resistance to the purine analogue 
6-thioguanine (6-TG). Out of six isolated and analysed 6-TG-resistant 
colonies, three harboured an identical gene trap insertion at the NUDT5 
autosomal gene (Fig. 2g). The disruption of this gene was recently 
confirmed to confer 6-TG resistance in human cells!” (Fig. 2h). 
Detection of a loss-of-function phenotype due to an autosomal muta- 
tion thereby validates that genetic screening is feasible in haploid human 
ES cells. 

The ability of human ES cells to exist as both haploids and diploids 
led us to investigate whether these two ploidy states differ in certain 
aspects of gene regulation and cell biology. To analyse haploid and 
diploid ES cells in the same cell cycle phase, we used FACS to iso- 
late G1-phase haploid cells (1c) and compared them with isogenic 
G1-phase diploid cells (2c) from unsorted diploid cultures (Fig. 3a and 
Extended Data Fig. 5a; see Methods). We first aimed to uncover puta- 
tive ploidy-associated differences by comparing the transcriptomes 
of haploid and diploid ES cells using RNA sequencing (RNA-seq), 
considering that observed changes in expression levels would be 
relative to the total gene expression of each ploidy state, rather than 
representing absolute differences. On the genome-scale, undifferen- 
tiated haploid and diploid ES cells clustered closely and separately 
from differentiated embryoid bodies (EBs) (Fig. 3b). Nonetheless, 
we identified 275 relatively upregulated and 290 relatively down- 
regulated genes in haploids compared with diploids (greater than 
twofold change, false discovery rate (FDR) <0.05; Extended Data 
Fig. 5b). 
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Figure 3 | Molecular and cellular comparisons of haploid and diploid 
ES cells. a, Experimental scheme. b, RNA-seq-based clustering analyses 
of isogenic G1-sorted haploid and diploid cells (two biological replicates 
per line) and diploid pES12-derived EBs. c, Pie chart of relatively 
downregulated and upregulated genes in haploid versus diploid ES cells. 
d-g, Differential X chromosome inactivation in haploid and diploid ES 
cells. d, Genome-wide expression moving median plot. e, XIST expression 
levels. f, H3K27me3 staining. Scale bar, 101m. g, X chromosome DNA 


Notably, X chromosomal genes were significantly overrepresented 
among the relatively upregulated gene set (40%, P< 0.001, * good- 
ness of fit test) (Fig. 3c), and the expression levels of X chromosomal 
genes alone clearly distinguished between haploid and diploid ES cells 
(Fig. 3b). These data correlate with an expected differential status of 
X chromosome inactivation in haploid and diploid human ES cells: 
whereas the single-copy X chromosome in haploids is transcriptionally 
active (X,), one of the two X chromosomes in diploids is often inac- 
tivated (X,X;)!*. Indeed, haploid human ES cells exhibited a relative 
increase in X chromosomal gene expression and lacked expression of 
the XIST transcript which drives X chromosome inactivation (Fig. 3d, e 
and Extended Data Fig. 5b-d), as in diploid X,X, human ES cells!”. 
X chromosome inactivation is regulated by repressive histone mod- 
ifications and DNA methylation. H3K27me3 foci were consistently 
observed in unsorted diploid ES cells, but not in the haploid-enriched 
counterparts (Fig. 3f). Moreover, methylome analysis showed that the 
X chromosome DNA methylation signature of haploid ES cells resembles 
that of diploid male ES cells (X,Y), whose single-copy X chromosome 
is largely hypomethylated, as opposed to the composite pattern of a 
hypomethylated X, and a hypermethylated X; in diploid female cells 


methylation levels. h, Relative total RNA, cell volume and ratio of 
mitochondrial to genomic DNA (mtDNA/gDNA) between G1-sorted 
haploid and diploid ES cells. Number of replicates indicated in parenthesis. 
Error bars represent s.d. i, Mean expression levels + s.e.m. of nuclear and 
mitochondrial oxidative phosphorylation genes in haploid and diploid 

ES cells (replicates as in Fig. 2d), and their schematic organization in 

this pathway. *P < 0.05; **P < 0.01 (two-tailed unpaired Student's f test). 
Source data for h and i are available online. 


(Fig. 3g). Interestingly, recently diploidized ES cells (see Methods) 
remained X,X, soon after diploidization by all assays mentioned earlier 
(Fig. 3a, d-g). 

Normalization to total gene expression”? resulted in seemingly 
similar expression levels of autosomal genes, but higher levels of 
X-linked genes in haploid cells (Fig. 3d and Extended Data Fig. 5c). 
However, assuming that the absolute expression of X-linked genes in 
haploid X, and diploid X,X; cells are equivalent, these data suggest a 
genome-wide autosomal gene expression level reduction in haploids 
(Extended Data Fig. 5e, f). In support of this, total RNA amounts 
isolated from haploid ES cells were significantly lower than those from 
equal numbers of diploid cells (Fig. 3h). A decrease in total gene expres- 
sion implied that physical dimensions may also be altered. Indeed, 
the average haploid:diploid diameter ratio of G1-sorted ES cells was 
~0.8, corresponding to ~0.7 surface area ratio and ~0.6 volume ratio 
(Fig. 3h and Extended Data Fig. 5g). 

We subsequently focused on consistent differential regulation 
within autosomes (see Methods). By transcriptome and methylome 
analyses, genes relatively downregulated in haploid ES cells were 
significantly enriched for genes encoding signal-peptide-bearing 
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proteins (Extended Data Fig. 5h). Remarkably, we also detected 
subtle yet significant relative upregulation of 11 genes involved in 
oxidative phosphorylation in haploid cells, including representa- 
tives encoding subunits of four out of the five complexes comprising 
this pathway (Fig. 3i and Extended Data Fig. 5i). Furthermore, all 
13 mitochondrially encoded oxidative phosphorylation genes were 
consistently upregulated as well (Fig. 3i), indicating coordinated 
regulation between nuclear and mitochondrial genes. This coincided 
with a 32% increase in the mitochondrial DNA (mtDNA) to nuclear 
DNA ratio between haploids and diploids (Fig. 3h), suggesting 
that mitochondrial abundance relative to nuclear DNA content is 
higher in haploid cells. 

We next sought to assess the differentiation potential of haploid 
human ES cells. The 21-day-old EBs generated by spontaneous dif- 
ferentiation of haploid-enriched and diploid ES cells could not be 
distinguished by their appearance (Fig. 4a), and the morphology of 
dissociated haploid-cell-derived EB cells was consistent with differ- 
entiation (Extended Data Fig. 6a). Notably, metaphase spreading 
revealed a haploid karyotype (Fig. 4b; 4/4 metaphases), and a largely 
haploid (~70%) DNA profile was confirmed by flow cytometry in both 
h-pES10-derived and h-pES12-derived EB cells (Fig. 4c and Extended 
Data Fig. 6b). We then compared the gene expression profiles of 
G1-sorted haploid ES cells and EB cells, focusing on 18 lineage-specific 
genes across nine cell types (Fig. 4d). Whereas expression levels were 
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negligible in undifferentiated ES cells, all tissue-specific genes were 
expressed in haploid and diploid EB cells (Fig. 4d and Extended Data 
Fig. 6c). Haploid and diploid EB cells showed insignificant expression 
of pluripotency-specific genes, consistent with efficient acquisition of 
differentiated cell fates. 

To extend this analysis to more specific cell types, we subjected 
haploid ES cells to directed differentiation assays. Haploid ES cells 
differentiated towards a neural fate for ten days remained haploid, 
while giving rise to NCAM1-positive neural progenitor cells (NPCs, 
with ~90% efficiency) (Extended Data Fig. 7a, b). Sorted haploid 
NPCs expressed multiple neural-lineage-specific genes but not pluri- 
potency-specific genes (Extended Data Fig. 7c, d). X chromosome 
inactivation in differentiated diploid female cells results in dosage 
compensation of 1:2 between the X chromosome and autosomes. As 
haploid cells do not inactivate their single-copy X chromosome, an 
X:autosomes dosage imbalance of 1:1 should persist into the differ- 
entiated state. Indeed, both haploid NPCs and EB cells showed an X, 
signature contrary to the X,Xj signature of diploid EB cells as indicated 
by whole-genome expression analysis and XIST levels (Fig. 4e and 
Extended Data Fig. 7e). 

Neuronal differentiation was not restricted to the progenitor 
stage as the cells also differentiated with high efficiency (>90%) 
into mature TUJ1 (also known as 6-tubulin III)-positive neurons by 
20 days, with persistence of haploid cells as shown by co-staining 
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with centromeres (Fig. 4f; 47% haploids, n = 104) and FISH analysis 
(Extended Data Fig. 7f, g; 46% haploids, n = 200). Similarly, haploid 
cells differentiated into TNNT2-expressing cardiomyocytes (Fig. 4g; 
32% haploids, n = 97) during an eleven-day protocol resulting in 
spontaneously beating clusters (Supplementary Video 1), and 39% 
(n=31) of haploid cells sorted from the whole culture (25% 1c-cells) 
were confirmed as TNNT2-positive (Fig. 4h and Extended Data 
Fig. 7h). Next, we differentiated haploid-enriched cultures (~70% 
haploids) to the pancreatic lineage. Analysing two stages of 
differentiation by centromere foci analysis, we observed robust 
differentiation (>90%) of both haploids and diploids into FOX A2- 
positive definitive endoderm cells (Fig. 4i; 56% haploids, n= 112), 
and differentiation into PDX1-positive pancreatic cells (Fig. 4j; 13% 
haploid, n = 103), some of which were also positive for NKX6-1. 
In addition, the persistence of haploid PDX1-positive cells was 
confirmed by flow cytometry (Fig. 4k; 10% PDX1-positive 1c cells; 
and Extended Data Fig. 7i, j). 

Finally, both haploid-enriched human ES cell lines gave rise to tera- 
tomas comprising cell types of ectodermal, mesodermal and endoder- 
mal origins (Fig. 4l and Extended Data Fig. 8a, b), meeting the most 
stringent criterion for human pluripotency in vivo. Importantly, no 
residual undifferentiated OCT4-positive cells were detected (Fig. 41 
and Extended Data Fig. 8b). Upon dissection, DNA content analysis 
revealed that a considerable population of h-pES10-derived teratoma 
cells remained haploid (Fig. 4m). Combined analysis of serial sections 
from an independent, h-pES12-derived teratoma, by histology and 
FISH confirmed the existence of in vivo differentiated haploid human 
cells capable of contributing to an organized tissue structure while 
responding to developmental signals (Fig. 4n). Haploid cells were iden- 
tified in all analysed teratomas (n= 4), albeit with variable proportions, 
which may be influenced by the initial amount of haploid cells and/or 
the duration of teratoma formation. 

Haploid mammalian cells have proven invaluable for loss-of-function 
screens’. Using a genome-wide library of gene-trapped haploid human 
ES cells, we demonstrated their potential for biomedically relevant 
functional genomics by forward genetic screening (Supplementary 
Discussion). Whereas previous studies on non-human haploid ES 
cells mostly emphasized the similarity between haploids and diploids, 
here we also pointed to several transcriptional, epigenetic and phys- 
ical properties that set them apart (Supplementary Discussion). 
Interestingly, we did not observe global transcriptional compen- 
sation in haploid cells, indicating that it is not required for cellular 
viability as long as an autosomal balance is preserved. In contrast, 
autosomal imbalance appears intolerable based on the strict absence 
of human autosomal monosomies in in vitro fertilization ES cells”’. 
Remarkably, we found that a haploid human karyotype is not a bar- 
rier for ES cell differentiation. As observed in the mouse’, haploid 
human ES cells gave rise to NPCs while remaining haploid. However, 
while mouse studies showed that haploid cells are lost upon further 
differentiation?"4, we observed specification of human haploid cells 
into somatic cell fates of all three embryonic germ layers, despite per- 
sistent dosage imbalance between the X chromosome and autosomes 
(Supplementary Discussion). 

Throughout evolution, mammalian genomes have been solidi- 
fied by diploidy-dependent adaptations such as parental imprinting, 
which restrict the development of haploid uniparental embryos. 
Nonetheless, haploid cells are capable of directing development in 
certain animal species’. The surprising differentiation potential 
of haploid human genomes suggests that diploidy-dependent 
adaptations, rather than haploidy, pose the predominant barriers for 
uniparental development in humans. The discovery of haploid human 
ES cells should thus provide novel means to delineate basic aspects of 
human genetics and development. 

Online Content Methods, along with any additional Extended Data display items and 


Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

Human oocyte manipulation and parthenogenetic ES cell line derivation. 
Human oocyte donation and pES and swaP$S cell line derivation procedures were 
described previously!!”?. Oocyte donors gave informed consent. Experiments 
were approved by the embryonic stem cell research oversight committee and the 
institutional review board at Columbia University Medical Center. Briefly, mature 
MII oocytes were activated using a calcium ionophore and/or an electrical pulse, 
followed by 4h of culture with puromycin. Polar body extrusion and the presence 
of a single pronucleus indicating haploidy were confirmed, and oocytes were 
allowed to develop to the blastocyst stage. swaPS cells were derived following acti- 
vation of an oocyte whose nuclear genome had been swapped with that of another 
oocyte!!. ES cell lines were derived by laser ablation of the trophectoderm™! and 
addition of ROCK inhibitor Y-27632 at 10|.M to the derivation medium”. Then 
2-3 days after plating, remaining trophectoderm cells were laser ablated, and 
inner cell mass cells were allowed to grow for 10-14 days until manual picking 
of the outgrowth was feasible. 

Cell culture. Unless otherwise stated, human ES cells were cultured on a feeder 
layer of growth-arrested mouse embryonic fibroblasts (MEFs) in standard human 
ES cell medium composed of Knockout Dulbecco’s Modified Eagle's Medium 
supplemented with 15% Knockout Serum Replacement (KSR, Thermo Fisher 
Scientific), 2mM L-glutamine, 0.1 mM nonessential amino acids, 50 units ml"! 
penicillin, 50j1g ml~! streptomycin, 0.1 mM (3-mercaptoethanol and 8ng ml"! 
basic fibroblast growth factor (bFGF). Cells were free of mycoplasma and main- 
tained in a humidified incubator at 37°C and 5% COp. Passaging was carried out 
either mechanically with gentle trypsinization using trypsin solution A without 
EDTA (Biological Industries), or enzymatically using TrypLE Express (Thermo 
Fisher Scientific) with addition of 101M ROCK inhibitor Y-27632 (Stemgent) for 
1 day after splitting. Haploid ES cells could also be grown in feeder-free conditions 
on Matrigel-coated plates (Corning) in mTeSR1 (STEMCELL Technologies) or 
StemFitN.AK03 (Ajinomoto) media. 

Isolation and maintenance of haploid human ES cell lines. Following iden- 
tification of haploid cells in human parthenogenetic ES cell lines at passages 
6-7 by either metaphase spread analysis or sub-2c-cell sorting (see below and 
Extended Data Tables 1 and 2), haploid ES cell lines were established by sorting 
the 1c-cell population, with diploid cells serving as a reference. Haploid ES cell 
cultures were further maintained by enrichment rounds of 1c-cell sorting every 
3-4 passages. 

Metaphase spread analysis. For induction of mitotic arrest, growing cells 
were incubated for 40 min in the presence of 100 ng ml"! colcemid (Biological 
Industries), added directly to the culture medium in a humidified incubator at 
37°C with 5% CO>. The cells were then trypsinized, centrifuged at 1,000 r.p.m. at 
room temperature and gently resuspended in 37°C warmed hypotonic solution 
(2.8mg ml? KCland 2.5 mg mI! sodium citrate) followed by 20 min of incubation 
at 37°C. Cells were fixed by addition of fixative solution (3:1 methanol:acetic acid) 
and incubation for 5 min at room temperature. Fixation was repeated at least three 
times following centrifugation and resuspension in fixative solution. Metaphase 
spreads were prepared on slides and stained using the standard G-banding tech- 
nique. Karyotype integrity was determined according to the International System 
for Human Cytogenetic Nomenclature (ISCN) based on the observation of a nor- 
mal karyotype in at least 80% of analysed metaphases (minimum of 20 metaphases 
per analysis). 

Live ES cell sorting by DNA content. Cells were washed with phosphate buffered 
saline (PBS), dissociated using either TrypLE Select or TrypLE Express (Thermo 
Fisher Scientific) and stained with 10,1gml~! Hoechst 33342 (ref. 2) (Sigma- 
Aldrich) in human ES cell medium at 37°C for 30 min. Following centrifugation, 
cells were resuspended in PBS containing 15% KSR and 10\1M ROCK inhibi- 
tor Y-27632, filtered through a 70-|.m cell strainer (Corning) and sorted using 
the 405 nm laser in either BD FACSAria III or BD Influx (BD Biosciences). For 
continued growth, sorted cells were plated with fresh medium containing 101M 
ROCK inhibitor Y-27632 for 24h. For comparative analyses, G1-phase cells were 
sorted from isogenic haploid-enriched and unsorted diploid cultures. Cells that 
had undergone diploidization relatively recently in culture (within 3 passages after 
haploid cell enrichment) were isolated by sorting the 4c peak in haploid-enriched 
cultures and compared with 4c diploid cells from unsorted diploid cultures. Note 
that haploid-enriched cultures also consist of a mixed 2c-cell population of G2/M- 
phase haploids and G1-phase diploids. Sorting purity was confirmed by rerunning 
a fraction of sorted samples through the instrument. 

Flow cytometry. All DNA content profiles were generated based on flow cytom- 
etry with Hoechst 33342 staining. Haploid cell proportion was estimated based 


on the percentage of 1c cells and the relative contribution of G1 cells with regards 
to other phases of the cell cycle. Estimation of diploidization rate was based on 
the proportion of haploid cells between consecutive enrichment rounds as well as 
experimental analysis of h-pES10 diploidization kinetics throughout 7 passages 
(30 days) by analysing the DNA content of 2-3 replicates at each passage using 
flow cytometry with propidium iodide in methanol-fixed and RNase-treated 
cells. Diploidization rate was estimated by fitting the data to an exponential decay 
curve. For simultaneous flow cytometry analysis of DNA content and cell surface 
molecules, cells were washed, dissociated and incubated on ice for 30 min in the 
presence of 10j1gml~! Hoechst 33342 (Sigma-Aldrich) and either a conjugated 
antibody or a secondary antibody diluted 1:200 following a 60 min incubation with 
a primary antibody. For simultaneous flow cytometry analysis of DNA content and 
intracellular PDX1, dissociated cells were treated as described for immunofluo- 
rescence procedures, with Hoechst 33342 for DNA staining. Primary antibodies 
are detailed in Supplementary Table 1. In all flow cytometry procedures, samples 
were filtered through a 70-|um cell strainer (Corning Life Sciences) and analysed 
with either BD FACSAria III or BD Influx (BD Biosciences). 

DNA fluorescence in situ hybridization. DNA FISH was performed as described 
elsewhere”? using probes for human chromosomes 2 and 4 and DNA staining with 
4',6-diamidino-2-phenylindole (DAPI). Haploidy and diploidy were respectively 
determined per nucleus based on single or double hybridization signals. ES cells 
subject to FISH were grown on Matrigel-coated plates in StemFitN.AK03 medium 
for several passages before analysis. 

Alkaline phosphatase and immunofluorescence staining. Alkaline phosphatase 
staining was performed using the Leukocyte Alkaline Phosphatase Kit (Sigma- 
Aldrich). For immunofluorescence staining, samples were washed with PBS, 
fixed with 4% paraformaldehyde for 10 min, and permeabilized and blocked 
in blocking solution (0.1% Triton X-100 and 5% donkey serum in PBS). Cells 
were incubated with primary antibodies (detailed in Supplementary Table 1) 
and secondary antibodies diluted 1:500 in blocking solution, and DAPI was 
used for DNA staining. Cells were washed twice with PBS subsequently to fix- 
ation and each incubation step. Images were taken using Zeiss LSM 510 Meta 
Confocal Microscope. Centromere quantification was carried out by manually 
counting centromere foci across individual planes along the z axis. EdU staining 
was performed using the Click-iT EdU Alexa Fluor 488 Imaging Kit (Thermo 
Fisher Scientific). ES cells subject to centromere staining in Fig. le and Extended 
Data Fig. le were grown on Matrigel-coated plates in StemFitN.AK03 for several 
passages before analysis. 

6-TG resistance screen. To generate a gene trap mutant library, 9 replicates of 
approximately 4 x 10° haploid pES10 cells (within one passage after 1c-cell enrich- 
ment) were co-transfected with 201g 5’-PTK-3’ gene trap vector*® and 201g 
pCyL43 piggyBac transposase plasmid?’ using Bio-Rad Gene Pulser (suspended 
in 800,1L Opti-MEM, 4-mm cuvettes, 320 V, 2501F), and replated on a 100 x 20 
mm dish with DR3 MEFs and ROCK inhibitor Y-27632. Selection for insertions 
into expressed loci was carried out using 0.3 1.gml~! puromycin starting 48 h post 
transfection, followed by pooling into a single library, represented by approxi- 
mately 16,000 resistant colonies. Transfection with 5’-PTK-3’ only was used as 
a negative control. To screen for 6-TG-resistant mutants, the mutant library was 
grown in the presence of 6\1M 6-TG (Sigma-Aldrich) on DR4 MEFs for 18 days, 
during which 6 resistant colonies were independently isolated and characterized. 
Analysis of a resistant clone showed persistence of haploid cells. Genomic DNA 
was extracted (NucleoSpin Tissue Kit, MACHEREY-NAGEL) and insertion sites 
were detected using splinkerette PCR as described previously”, followed by PCR 
product purification and Sanger sequencing (ABI PRISM 3730xl DNA Analyzer 
(Applied Biosystems)). Sequences were mapped to the human genome (GRCh38/ 
hg38) using UCSC BLAT search tool. 

Isolation of total DNA and RNA. Total DNA was isolated using the NucleoSpin 
Tissue Kit (MACHEREY-NAGEL). Total RNA was isolated using Qiagen RNeasy 
Kits according to the manufacturer’s protocols. To determine total RNA levels per 
cell, haploid and diploid cells were isolated from the same cultures by sorting the 
1c (haploid G1) and 4c (diploid G2/M) populations, respectively. Following growth 
for 2 passages, cells were harvested and counted, and RNA was isolated from tripli- 
cates of 400,000 cells from each cell line and ploidy state (pES10 and pES12, haploid 
and diploid; 12 samples in total). RNA amounts were quantified using NanoDrop. 
Genome integrity analysis. Copy number variation (CNV) analysis was carried 
out on DNA samples of G1-sorted haploid and diploid pES10 and pES12 cells 
(see Supplementary Table 2) using Infinitum Omni2.5Exome-8 BeadChip single 
nucleotide polymorphism (SNP) arrays (Illumina) following the manufacturer's 
protocols. Raw data were processed using Genome Studio Genotyping Module 
(Illumina) to obtain log R ratios values for analysis using R statistical programming 
language. As expected, diploid pES10 and pES12 cells were homozygous across 
all chromosomes. 
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RNA sequencing. For a detailed list of samples analysed by RNA-seq, see 
Supplementary Table 3. Total RNA samples (200 ng-1 1g, RNA integrity number 
(RIN) >9) were enriched for mRNAs by pulldown of poly(A)* RNA. RNA-seq 
libraries were prepared using the TruSeq RNA Library Prep Kit v2 (Illumina) 
according to the manufacturer's protocol and sequenced using Illumina NextSeq 
500 to generate 85 bp single-end reads. 

Transcriptome analysis. RNA-seq reads were aligned to the human reference 
genome (GRCh37/hg19) using TopHat (version 2.0.8b) allowing 5 mismatches. 
Reads per kilobase per million fragments mapped (RPKM) values were quan- 
tified using Cuffquant and normalized using Cuffnorm in Cufflinks (version 
2.1.1) to generate relative gene expression levels. Hierarchical clustering anal- 
yses were performed on RPKM values using Pearson correlation and average 
linkage. Analysis of differential gene expression relative to total RNA in haploid 
and diploid human ES cells (n =4 in each group) was carried out by two com- 
plementary strategies, as follows: first, we used Cuffdiff with default parameters, 
considering differences of greater than twofold with FDR <0.05 as significant; 
second, to identify possibly subtle yet consistent transcriptional differences, we 
tested for genes whose minimal expression levels across all replicates of a certain 
group were higher than their maximal expression level across all replicates of the 
other group. Statistical significance was then determined by two-tailed unpaired 
Student's t-test. Functional annotation enrichment analysis was done by DAVID 
(using the Benjamini method to determine statistical significance). Imprinting 
analyses included 75 human imprinted genes (http://www.geneimprint.com/), 
listed in Supplementary Table 4. RNA-seq data from control ES cell line NYSCF1 
were published elsewhere”? (GEO accession number GSE61657). Genome-wide 
gene expression moving median plots were generated using the R package zoo 
(version 1.7-12) after removal of genes that were not expressed in the averaged 
reference diploid sample by flooring to 1 and setting an expression threshold of 
above 1. RNA-seq data from different tissues were retrieved from the Genotype- 
Tissue Expression (GTEx) portal (http://www.gtexportal.org/)°°. Colour-coded 
scales in Fig. 4d correspond to gene expression levels relative to the mean across 
tissues (left scale) and across each set of ES cell duplicate and EB sample (right 
scale). Expression microarray analysis was performed as previously*! by using 
Affymetrix Human Gene 1.0 ST arrays. 

DNA methylation analysis. DNA methylation analysis was performed on 
genomic DNA from the samples detailed in Supplementary Table 2 using 
Infinium HumanMethylation450 BeachChips (Illumina) following the Infinitum 
HD Methylation Protocol as described previously”?. DNA methylation data from 
control ES cell line NYSCF1 were published before”® (GEO accession number 
GSE61657). Data were processed and normalized by using subset-quantile within 
array normalization (SWAN) and adjusted for batch effects using the R package 
ChAMP (version 1.4.0). DNA methylation levels at CpG sites associated with 
pluripotency-specific genes and iDMRs were analysed as described before”’. 
For analysis of DNA methylation levels on the X chromosome, probes with 
average ( values of less than 0.4 were filtered out. DMR analysis was facili- 
tated by the lasso function in ChAMP using default settings. DMRs were then 
assigned to genes by proximity and analysed for functional annotation enrich- 
ment using DAVID (using the Benjamini method to determine statistical 
significance). 

Cell size analysis. Following sorting of haploid and diploid cell populations in G1, 
the diameter (2r) of viable single cells was measured by Countess Automated Cell 
Counter (Invitrogen) and their surface area and volume were calculated as 47° 
and 4/3zr°, respectively. Analysis included 7, 4, 8 and 4 technical replicates for In 
pES10, 1n pES12, 2n pES10 and 2n pES12, respectively. 

Mitochondrial DNA abundance analysis. Relative mtDNA abundance was 
analysed by quantitative PCR (qPCR) by using primers for the mitochondrial 
gene MT-ND2 (forward primer: 5'-TGTTGGT TATACCCTTCCCGTACTA-3’; 
reverse primer: 5/-CCTGCAAAGATGGTAGAGTAGATGA-3’) and normal- 
ization to nuclear DNA by using primers for the nuclear gene BECN1 
(forward primer: 5’/-CCCTCATCACAGGGCTCTCTCCA-3’; reverse primer: 
5'-GGGACTGTAGGCTGGGAACTATGC-3’), as described elsewhere*”. 
Analysis was performed using Applied Biosystems 7300 Real-Time PCR System 
with PerfeCTa SYBR Green FastMix (Quanta Biosciences). Analysis included all 
G1-sorted samples detailed in Supplementary Table 2 (n= 4 for each group, with 
two biological replicates for each cell line). 

Embryoid body differentiation. EB differentiation was carried out by detaching 
ES cell colonies with Trypsin solution A without EDTA (Biological Industries), 
followed by resuspension and further culture of cell aggregates in human ES cell 
medium without bFGF on low attachment plates. Differentiation of haploid ES cells 
was initiated within 2 passages after 1c-cell enrichment. After 21 days, EB RNA 
was extracted from unsorted and/or sorted EB cells in G1 following dissociation 
and staining with 10j.gml~! Hoechst 33342 (Sigma-Aldrich) at 37°C for 30 min. 
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Metaphase spread analysis was performed on dissociated EB cells plated on 0.2% 
gelatin and expanded in human ES cell medium without bFGE 

Differentiation into neural progenitor cells. NCAM1-positive ES cell-derived 
neural progenitor cells were obtained using a 10-days protocol for efficient neural 
differentiation® with slight modification**. Differentiation was initiated within 
2 passages after 1c-cell enrichment. RNA was extracted from sorted haploid 
NCAM1-positive cells in G1 by co-staining with Hoechst 33342 and an anti- 
human NCAM-1/CD56 primary antibody (see Supplementary Table 1) and a Cy3- 
conjugated secondary antibody (Jackson Immunoresearch Laboratories) diluted 
1:200. 

Neuronal differentiation. Differentiation into neurons was carried out by follow- 
ing a published protocol*° based on synergistic inhibition of SMAD signalling** 
with modification, as follows: differentiation was initiated within 2 passages after 
1c-cell enrichment with fully confluent ES cells cultured on Matrigel-coated plates 
in mTeSR1 by replacing the medium with human ES cell medium without bFGF, 
containing 10\1M SB431542 (Selleckchem) and 2.5 1M LDN-193189 (Stemgent) 
for 4 days. Subsequently, cells were kept in N2 medium* supplemented with 10\.M 
SB431542 and 2.5|1M LDN-193189 for an additional 4 days, followed by 2 days in 
N2 medium supplemented with B-27 (Thermo Fisher Scientific) and 101M DAPT 
(Stemgent). The cells were then dissociated and replated on 0.01% poly-L-ornithine 
coated (Sigma-Aldrich) and laminin coated (41g ml !, Thermo Fisher Scientific) 
plates in the presence of 101M ROCK inhibitor Y-27632 (Selleckchem), and fur- 
ther cultured in the same medium without Y-27632 for the next 4 days. Neuronal 
cultures were maintained in N2 medium supplemented with B-27 and 20ng mI"! 
BDNF (R&D) until analysis by immunostaining and FISH on day 20. 
Cardiomyocyte differentiation. 80-90% confluent ES cells grown on Matrigel- 
coated plates in mTeSR1 were subject to an 11-day regimen” based on consecutive 
GSK3 and WNT inhibition with CHIR99021 and IWP-2 (Selleckchem), respec- 
tively. Differentiation was initiated within 2 passages after 1c-cell enrichment. On 
day 11 of differentiation, 1c cells were sorted and plated for immunostaining. 
Differentiation towards the pancreatic lineage. The protocol used here was devel- 
oped based on several recent publications**“°. ES cells grown in feeder-free con- 
ditions were differentiated into definitive endoderm by using STEMdiff Definitive 
Endoderm Kit (StemCell Technologies) for 3-4 days. Subsequent specification was 
achieved by a step-wise protocol involving treatment with recombinant human 
KGF/FGF7 (R&D Systems), LDN-193189 (Stemgent), KAAD-cyclopamine 
(Stemgent) and retinoic acid (Stemgent). On days 8-11, EGF (R&D System) 
was used to induce pancreatic cells. Differentiation was initiated within as few as 
2 passages after 1c-cell enrichment. 

Teratoma formation assay. All experimental procedures in animals were approved 
by the ethics committee of the Hebrew University. ES cells were trypsinized 
and approximately 2 x 10° cells were resuspended in 100] human ES cell 
medium and 100,11 Matrigel (BD Biosciences), followed by subcutaneous 
injection into NOD-SCID Tl2arg’ ~ immunodeficient mice (Jackson Laboratory). 
8-12 weeks after injection tumours were dissected and subjected to further 
analysis. Histological slides were prepared from tumour slices cryopreserved 
in O.C.T. compound (Sakura Finetek) using Leica CM1850 cryostat (Leica 
Biosystems, 10-\1m sections), followed by immunostaining, haematoxylin and 
eosin staining or FISH analysis. Flow cytometry with Hoechst 33342 staining was 
performed on dissociated cells from freshly dissected tumours. 
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Extended Data Figure 1 | Derivation of haploid human ES cell lines 
h-pES10 and h-pES12. a, Left panel, diploidization rate model for a 
haploid egg with a theoretical diploidization probability of 10%, overlaid 
with an exponential decay fit (red curve, see Supplementary Notes). 
Approximated cell cycle numbers for different ES cell line derivation 
stages are indicated. Right panel, diploidization dynamics of h-pES10 
over seven passages by flow cytometry, overlaid with an exponential 
fit to the data (red curve). Bars indicate the mean across biological 
replicates. b, Establishment of a haploid-enriched human ES cell line 
from pES12 cells after repeated sorting and enrichment of Ic cells 
using Hoechst 33342 staining. Top to bottom, DNA content profiles of 


unsorted diploid cells, partially purified haploid cells at the third sort, 
and mostly-purified haploid cells at the fifth sort. c, chromosomal copies. 
c, Karyotypes and haploid metaphase percentage over the course of 
enrichment and passaging. d, e, DNA FISH (d) and centromere protein 
immunofluorescence staining (e) in haploid-enriched pES12 cells (upper 
panels) and unsorted diploid pES10 cells (lower panels). Magnified insets 
show representative haploid and diploid nuclei with single or double 
hybridization signals (d) and 23 or 46 centromeres (e), respectively. Scale 
bars, 10 jm. f, SNP array-based CNV analysis comparing haploid pES10 
and pES12 cells with their unsorted diploid counterparts (logarithmic 
scale). 
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Extended Data Figure 2 | Determination of ploidy at single-cell level by 
quantification of centromere foci. a, The counted number of centromeres 
correlates with ploidy. 1n, haploid-enriched pES10 cells grown for 

4 passages after the fourth sort (n = 33; 76% haploids by this assay); 

2n, unsorted diploid pES10 cells (1 = 34); 3n, soPS2 cells”? (n = 27); 

4n, Hybrid] cells! (n= 27). Black horizontal lines indicate mean + s.e.m. and 
dashed lines mark expected chromosome numbers. b, Quantification of 
haploid and diploid cells by DNA FISH in the haploid-enriched (n= 152; 
73% haploids by this assay) and diploid (n = 135) cells in a. c, DNA 


content profile of the haploid-enriched cells in a (73% haploids by this 
assay). c, chromosomal copies. d, e, Co-staining of centromeres and either 
phospho-histone 3 (pH3, Ser10) (d) or 5-ethynyl-2’-deoxyuridine (EdU) 
(e) for distinguishing between different stages of interphase in haploid 
pES12 cells. In blue, DNA staining. Scale bar, 51m. f, Quantification of 
centromere counts in the different cell cycle stages shown in d and e. 

n indicated in parenthesis. Black horizontal lines indicate mean + s.e.m. 
See Supplementary Notes for details. 
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Extended Data Figure 3 | Pluripotent stem cell markers in haploid in men and h-pES12 at colony resolution (upper panels; scale bars, 
human ES cells. Co-staining of pluripotency markers NANOG, OCT4, 501m) and single-cell resolution (lower panels; scale bars, 10 |1m). 
SOX2, SSEA4 and TRA-1-60 (red), centromeres (green) and DNA (blue) Magnified insets show representative haploid cells with 23 centromeres. 


h-pES10 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


a c 
Gene expression (RPKM) 5 
25 g 1.0 
2 
15 & 
CPT TITITT) i 
2 05 
SSLEeMVETSENMNTE & 
wieeseesesnaungon g 
SS w 
rr2008888888 = 9 
a oa &. 82 2 4 & & oc 
ie a ae Oe, ae ee ee 
-~-rWirr nna 
IVF ES pES 
Chromosome: 
b d 
DNA methylation (8 values) @ 10 
3 gs . 
= 09 
2 & 08 
1 q 07 
sir a. Td & 06 
SS SC. «> ES en ER a ee ene 5 0.5 
oon cCcYWeowoeoekeowt = 
BahPPS=ce2soconnaa @ 04 
rrrrrr,re,r > 
PETNH HH HHH NH £ 03 
Zuuwuuwuuouww ww ® 02 
FE&&a&GEE < 04 
Zz 04 
fa) 
IVF ES pES 
Chromosome: 
e 


Gene trap vector 


5'ITR SAIRES PuroAtk pA 


m2nIVFES 
B inpES 
B@ 2n pES 


nN BeAEEBE B 
iva) ® a YW Ff 5S & SS A 
as eEgeR rE E 
ta Ss Ss 17) n S 
2 6 10 15 19 20 
Paternally- Maternally- 
methylated methylated 
f 2 8 SS EVYER SE 
go = L 6 S Wer Skt 
Q S < =a 2228 
=} te S Ss yn ~Y 
2 10 14 4 6 11 15 
3'ITR 


—_» En ><a 


FRT 


Trapped genomic locus 


FRT 


| Transposition 


>) ><a 


Truncated transcript 


Transcription 
and processing 


= 0. 


Extended Data Figure 4 | Analysis of parental imprinting and gene trap 
mutagenesis in haploid human parthenogenetic ES cells. 

a, b, Clustering analysis of diploid in vitro fertilization (IVF) ES cells 

and G1-sorted haploid and diploid parthenogenetic ES (pES) cells by 
expression levels of imprinted genes (n =75, see Supplementary Table 4) 
(a) and DNA methylation levels at imprinted differentially methylated 
regions (iDMRs, n= 35)? (b). The symbols (1) and (2) indicate biological 
replicates. c, Relative mean expression levels + s.e.m. of representative 
paternally expressed imprinted genes across seven chromosomes in 

the samples shown in a (RPKM ratios). d, Mean DNA methylation 

levels + s.e.m. at representative paternally methylated and maternally 
methylated iDMRs (typically intermediately methylated in bi-parental 


control cells, and respectively hypomethylated and hypermethylated in 
parthenogenetic cells) in the samples shown in b. 6 values range from 
complete hypomethylation (0) to complete hypermethylation (1). 

e, Schematic outline of the piggyBac gene trap system. The gene trap 
vector”® is flanked by piggyBac inverted terminal repeats (ITRs) and FRT 
flox sites, and carries a 5’ splice acceptor (SA), an internal ribosome entry 
site (IRES) element followed by a promoterless puromycin resistance gene 
(Puro Atk) and a 3’ poly(A) signal (pA). In the presence of the PiggyBac 
transposase (encoded on a separate plasmid”’, not shown), the gene trap 
vector undergoes random transposition into the genome. Insertion into 

a transcriptionally active gene results in truncation of the endogenous 
transcript and introduction of resistance to puromycin. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Comparative analyses of isogenic haploid and 
diploid human ES cells. a, Sorting purity of haploid and diploid ES cells 
in G1. b, Log-scaled volcano plots of relative differential gene expression 
between haploid and diploid human ES cells, divided into panels by 

all genes (top), autosomal genes (middle) and X chromosomal genes 
(bottom). Q, false discovery rate (FDR). Significantly downregulated and 
upregulated genes (greater than twofold change, Q < 0.05) in haploid cells 
are marked in red and blue, respectively, and their totals are indicated to 
the right. Note that XIST is the most downregulated transcript in haploid 
cells. c, Smoothed distributions of the 1n/2n gene expression ratios for 

all expressed genes, all expressed autosomal genes and all expressed 

X chromosomal genes (expression threshold, mean RPKM >0.1). 

d, Genome-wide moving median plot of the gene expression ratio between 
haploid and diploid pES10 cells in G1 by expression microarray analysis 
(window size = 100 genes). e, f, Model for genome-wide autosomal 


gene expression level reduction in haploid human ES cell as inferred by 
differential X chromosome inactivation status. e, DNA content, RNA 
expression levels relative to total RNA and presumed equality of absolute 
X chromosomal gene dosage in haploid (X,) and diploid (X,X;) human ES 
cells, enable the estimation of total RNA levels per haploid cell. X, and X; 
denote active (blue) and inactive (red) X chromosomes, respectively. A, 
autosomes; X, X chromosome; R, total RNA. f, Schematic genome-wide 
representation of relative and absolute RNA levels in the cells shown in e. 
g, Average diameter and calculated surface area and volume of G1-sorted 
haploid and diploid ES cells. Error bars represent s.d. *P < 0.01 (two-tailed 
unpaired Student's t-test). h, i, Functional annotation enrichment analysis 
for relatively downregulated genes and differentially methylated regions 
(DMRs) (h), as well as relatively upregulated genes (i) in haploid ES cells 
compared with diploid ES cells. 
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Extended Data Figure 6 | EB differentiation of haploid human ES cells. diploid pES12 cells. c: chromosomal copies. ¢, Expression levels (RPKM) 
a, Representative image of plated cells dissociated from h-pES12-derived of tissue- and pluripotency-specific genes in undifferentiated (ES) and 
21-day EBs. Karyotype is shown in Fig. 4b. Scale bar, 100j1m. b, DNA differentiated (EB) G1-sorted haploid (1n) pES10 cells. 

content profiles of dissociated EBs derived from haploid-enriched and 
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Extended Data Figure 7 | Directed differentiation of haploid human 
ES cells. a, b, Flow cytometry analysis with co-staining of DNA and 
NCAM1 in h-pES10 cells following neural differentiation. a, Gating for 
NCAM1-positive cells (right) based on a negative secondary antibody 
stained control sample (left). b, DNA content profiles of the entire cell 
population (left) and NCAM1-positive cells (right). c, chromosomal 
copies. c, d, Expression levels of neural- and pluripotency-specific 
genes in haploid G1-sorted ES cells and NPCs. e, XIST expression levels 
in haploid and diploid pES10-derived EBs and NPCs. f, TUJ1 staining 
in h-pES12-derived neurons. Scale bar, 100 1m. g, DNA FISH in the 


Hoechst 33342-A 
(DNA content units) 


neurons shown in f. Magnified insets show representative haploid and 
diploid nuclei with single and double hybridization signals, respectively. 
Scale bar, 101m. h, TNNT2 staining in Gl-sorted haploid pES12-derived 
cardiomyocytes. Scale bar, 101m. i, j, Flow cytometry analysis with 
co-staining of DNA and PDX] in h-pES10 cells following pancreatic 
differentiation. i, Gating for PDX1-positive cells (right) based on a 
negative secondary antibody stained control sample (left). j, DNA content 
profile of the entire cell population (related to Fig. 4k). c, chromosomal 
copies. 
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Extended Data Figure 8 | In vivo differentiation of haploid human ES (mesoderm), AFP (endoderm) and OCT4 (pluripotency) staining in 
cells. a, Haematoxylin and eosin histological sections of teratomas derived an h-pES10-derived teratoma. DNA staining is shown in blue. Note the 
from h-pES10 and h-pES12. Scale bar, 501m. b, TUJ1 (ectoderm), a-SMA absence of nuclear OCT4 staining. Scale bars, 100 1m. 
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Extended Data Table 1 | Identification of haploid cells in early-passage human parthenogenetic ES cell lines by metaphase spread analysis 


5 Passage number nae bad Bomber eo Total number % Haploid 
pES cell line Oocyte donor at analysis haploid diploid of metaphases metaphases 
metaphases metaphases 
1 8/31/09 4 0 233 233 0 
2 1043 6 0 278 278 0 
3 1058 6 0 273 273 0 
4 1058 6 0 222 222 0 
5 1058 9 0 50 50 0 
e aga oF ; 140 140 ; 
8 1151 4 0 361 361 0 
9 1157 6 0 234 234 0 
10 1160 7 2 150 152 1.32 
11 1160 8 ie) 194 194 0 
Total: 2 2148 2150 


The derivation of cell lines pES1-6 was reported previously!! 23. 
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Extended Data Table 2 | Isolation of haploid cells from early-passage human parthenogenetic ES cell lines by sub-2c-cell sorting 


Passage number at second sort Presence of haploid cells at second sort 


pES cell line Oocyte donor Passage number at first sort 
6 1105 6 10 Undetected 
12 1160 6 10 Yes 
swaPS cell line* 
4 1126 6 ub) Undetected 
5 1155 5 9 Undetected 
11 1175 4 Sorted population did not survive 


*swaPS cells are parthenogenetic ES cells derived following activation of an oocyte whose nuclear genome had been swapped with that of another oocyte! 
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Mitochondrial ROS regulate thermogenic energy 
expenditure and sulfenylation of UCP1 


Edward T. Chouchani'*, Lawrence Kazak!**, Mark P. Jedrychowski’, Gina Z. Lu!?, Brian K. Erickson’, John Szpyt?, | 
Kerry A. Pierce*, Dina Laznik-Bogoslavski!, Ramalingam Vetrivelan’, Clary B. Clish?, Alan J. Robinson®, Steve P. Gygi? & 


Bruce M. Spiegelman!? 


Brown and beige adipose tissues can dissipate chemical energy as 
heat through thermogenic respiration, which requires uncoupling 
protein 1 (UCP1)!. Thermogenesis from these adipocytes can 
combat obesity and diabetes*, encouraging investigation of factors 
that control UCP 1-dependent respiration in vivo. Here we show 
that acutely activated thermogenesis in brown adipose tissue is 
defined by a substantial increase in levels of mitochondrial reactive 
oxygen species (ROS). Remarkably, this process supports in vivo 
thermogenesis, as pharmacological depletion of mitochondrial 
ROS results in hypothermia upon cold exposure, and inhibits 
UCP 1-dependent increases in whole-body energy expenditure. 
We further establish that thermogenic ROS alter the redox status 
of cysteine thiols in brown adipose tissue to drive increased 
respiration, and that Cys253 of UCP1 is a key target. UCP1 
Cys253 is sulfenylated during thermogenesis, while mutation of 
this site desensitizes the purine-nucleotide-inhibited state of the 
carrier to adrenergic activation and uncoupling. These studies 
identify mitochondrial ROS induction in brown adipose tissue as 
a mechanism that supports UCP1-dependent thermogenesis and 
whole-body energy expenditure, which opens the way to improved 
therapeutic strategies for combating metabolic disorders. 

The specialized capacity for thermogenic respiration in brown 
adipose tissue (BAT) and beige fat* relies on UCP1. This mitochon- 
drial inner membrane carrier dissipates protonmotive force (Ap) 
and increases the rate of substrate oxidation to generate heat’. While 
UCP1 is critical for thermogenesis, mechanisms that support UCP1- 
dependent respiration in vivo are not fully understood!”. 

We examined features of mitochondrial function in mouse inter- 
scapular BAT after cold (4°C) exposure. Unexpectedly, we found that 
acute activation of BAT thermogenesis in vivo was associated with 
a substantial increase in mitochondrial ROS levels. Mitochondrial 
superoxide (Fig. 1a), lipid hydroperoxides (Fig. 1b), and mitochondrial 
hydrogen peroxide (Fig. 1c)° levels increased significantly in BAT after 
acute cold exposure. Furthermore, primary brown adipocytes exposed 
to adrenergic stimulation of thermogenesis by noradrenaline exhibited 
acute dose-dependent increases in superoxide levels (Extended Data 
Fig. la). Having identified the induction of mitochondrial ROS as an 
early event during thermogenesis in BAT, we investigated whether it 
was a regulator of function in vivo. We injected mice intraperitone- 
ally (i.p.) with the mitochondria-targeted antioxidant MitoQ before 
cold exposure. MitoQ efficiently depletes mitochondrial free radical 
species in vivo and achieves substantial localization in metaboli- 
cally active tissues including adipose®. Treatment with MitoQ before 
cold exposure inhibited the cold-dependent increases in BAT mito- 
chondrial superoxide and lipid hydroperoxides (Fig. 1a, b). Notably, 
MitoQ administration resulted in a dose-dependent induction of 
hypothermia upon cold exposure (Fig. 1d). MitoQ had no effect on 


body temperature of mice without cold stress (Fig. 1d), and equimolar 
injections of decylT PP, the mitochondria-targeting moiety lacking 
the antioxidant group, had no significant effect on body temperature 
during cold exposure (Extended Data Fig. 1b). 

While BAT and beige fat are the primary tissues for mediating adap- 
tive thermogenesis, muscle shivering also plays an important role, 
particularly during the early thermogenic response’. Shivering was 
monitored directly by electromyography (EMG) after the transition 
from thermoneutrality to cold. As expected, by 30 min cold exposure 
substantial shivering was observed (Extended Data Fig. 1c—e), which 
could be abrogated by the nicotinic acetylcholine receptor inhibitor 
curare (Extended Data Fig. 1c). Importantly, treatment of mice with 
MitoQ before cold exposure caused no detectable change in shivering 
activity (Extended Data Fig. 1c-e). 

We next tested whether the hypothermic effects of depleting mito- 
chondrial ROS depended on UCPI1. Mice genetically lacking UCP1 
(UCP17‘~) are sensitive to acute cold challenge, but maintain thermal 
homeostasis after gradual acclimation to temperatures below ther- 
moneutrality through compensatory upregulation of alternative ther- 
mogenic pathways®. We injected wild-type (WT) and UCP1~/~ mice 
with MitoQ after gradual cold acclimation. Notably, UCP1~/~ mice 
were protected against the hypothermic effects of MitoQ (Fig. le). 
Moreover, mice injected with MitoQ 10 min before 83-adrenergic 
stimulation by CL 316,243 (CL)? exhibited reduced CL-dependent 
oxygen consumption (Fig. 1f, g and Extended Data Fig. 1f). These 
differences were observed in the absence of any significant changes 
in physical movement (Extended Data Fig. 1g). Together, these data 
provide evidence that mitochondrial ROS support UCP 1-dependent 
thermogenesis in vivo. 

It is increasingly appreciated that mitochondrial ROS act as sig- 
nalling molecules in physiology, which involves oxidative modi- 
fication of sensitive cysteine thiol residues on proximal protein 
targets!°. Strikingly, acute cold exposure drove substantial oxidation 
and depletion of the BAT glutathione pool (Extended Data Fig. 2a). 
Furthermore, by performing quantitative profiling of in vivo protein 
thiol redox status in BAT using ratiometric labelling with iodoTMT 
(Extended Data Fig. 2b), we found that BAT protein thiols became 
substantially more oxidized upon cold exposure (Fig. 2a and Extended 
Data Fig. 2c). Pathway analysis of those protein cysteine targets most 
substantially oxidized during thermogenesis revealed pathways 
involved in mitochondrial metabolism to be the most enriched (Fig. 2b 
and Supplementary Table 1). Finally, we profiled levels of protein 
sulfenic acids''—the proximal reversible cysteine modification gen- 
erated by lipid hydroperoxides and hydrogen peroxide!*—and found 
them to be significantly elevated upon cold exposure (Fig. 2c). These 
findings suggested that oxidation of mitochondrial thiols might be 
important for signalling ROS-dependent thermogenesis in BAT. 


1Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA. @Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA. 3Broad 
Institute of Harvard and MIT, Cambridge, Massachusetts 02142, USA. “Department of Neurology, Harvard Medical School, Boston, Massachusetts 02215, USA. 5MRC Mitochondrial Biology Unit, 


Hills Road, Cambridge CB2 OXY, UK. 
*These authors contributed equally to this work. 
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Figure 1 | Increased BAT mitochondrial ROS levels support UCP1- 
dependent thermogenesis in vivo. a—c, Effect of acute cold exposure 
on in vivo BAT (a) mitochondrial superoxide-dependent inactivation 
of mitochondrial aconitase (n= 5; 5mgkg~! MitoQ n=4), (b) lipid 
hydroperoxide content (m= 5), and (c) mitochondrial hydrogen-peroxide- 
dependent oxidation of Prx3. Oxidized Prx3 was assessed using the 
redox gel shift method described in Methods. MM, molecular mass. 
Cys. ox., cysteine oxidation. For uncropped scans see Supplementary 
Figs 1-3. d, Effect of i.p. MitoQ on core body temperature after acute 


We examined this possibility using N-acetylcysteine (NAC), a 
cell-permeable cysteine precursor shown to increase the intracel- 
lular pool of reduced thiols'*. Similar to what was observed with 
MitoQ, NAC treatment resulted in a significant decrease in core 
body temperature upon cold exposure (Fig. 2d). This hypother- 
mic activity suggests an effect on BAT function, as shivering was 
unaffected (Extended Data Fig. 1c-e). Furthermore, acute elevation 
in oxygen consumption by CL administration was inhibited by NAC 
pretreatment (Fig. 2e and Extended Data Fig. 2d). Taken together, 
these findings suggest that increased mitochondrial ROS levels and 
consequent thiol oxidation play important roles in driving BAT 
thermogenesis in vivo. 

The acute hypothermic effects of depleting mitochondrial ROS and 
altering thiol redox status occurred without any appreciable decrease 
in thermogenic gene expression in either BAT or inguinal white adi- 
pose tissue (Extended Data Fig. 3a, b). In fact, some thermogenic gene 
markers were further elevated following MitoQ or NAC treatment, 
possibly representing a compensatory increase in expression due to 
functional compromise of thermogenic respiration (Extended Data 
Fig. 3a, b). Furthermore, stimulation of the lipolytic machinery in 
BAT was comparable upon cold challenge after MitoQ treatment, sug- 
gesting this upstream driver of thermogenesis was not compromised 
(Extended Data Fig. 3c, d). To test whether mitochondrial ROS might 
act as direct modulators of UCP1-dependent leak respiration, we 
examined primary brown adipocytes using cellular respirometry'*”>. 
As expected, adrenergic simulation by noradrenaline triggered 
a substantial increase in oxygen consumption rate (OCR; Fig. 3a 
and Extended Data Fig. 3e); while inhibition of ATP synthase with 
oligomycin allowed for determination of leak respiration'*!> (Fig. 3a 
and Extended Data Fig. 3e). Depletion of mitochondrial ROS with 
MitoQ and reducing the thiol redox status with NAC both inhibited 
leak respiration (Fig. 3a, b and Extended Data Fig. 3e, f). Notably, these 


BB 5 mg kg" Mitoa 


BB 5 mg kg" Mitoa 


cold exposure (n= 10). e, Effect of ip. MitoQ on core body temperature of 
WT and UCP1~/~ mice after 4°C acclimation (n = 10; UCP1~/~ n=8). 

f, Oxygen consumed acutely before and after ip. CL + MitoQ (n=5). 

g, Oxygen consumed 3h before and after ip. CL + MitoQ (n=8). Data are 
mean +s.e.m. of at least four mouse replicates. *P < 0.05, ***P < 0.001 
(two-tailed Student's t-test for pairwise comparisons, one-way/two-way 
analysis of variance (ANOVA) for multiple comparisons involving one/two 
independent variables). 


inhibitory effects were lost in adipocytes lacking UCP1 (Fig. 3c, d and 
Extended Data Fig. 3g, h). 

Our findings indicated that ROS-supported thermogenesis relied on 
both UCP1 and cysteine thiol oxidation. So, we considered whether 
UCP1 cysteines were targets for oxidative modification during ther- 
mogenesis. We used a redox gel-shift method’ to determine whether 
BAT mitochondrial ROS drive cysteine-centred oxidative modifi- 
cations on UCP1 in vivo. This approach labels reversibly oxidized 
protein cysteines with a polyethylene glycol maleimide, allowing for 
their determination by immunoblotting after SDS—polyacrylamide 
gel electrophoresis’® (Extended Data Fig. 3i). In BAT from mice 
housed at thermoneutrality, UCP1 existed predominantly in a cysteine 
non-oxidized form (Fig. 3e). However, upon cold challenge cysteine 
oxidation increased on UCP1 in a time-dependent manner (Fig. 3e). 
Calibration of UCP1 cysteine shifts indicated that a single cysteine site 
was redox modified during thermogenesis (Extended Data Fig. 4a—c), 
and this modification did not appear to participate in intermolecular 
disulfide bridging (Extended Data Fig. 4d). Notably, pharmacological 
reduction of BAT thiol status with NAC, which results in hypothermia 
(Fig. 2d), diminished UCP1 cysteine thiol oxidation upon cold 
exposure (Fig. 3f). 

So, ROS-supported thermogenesis in BAT both depended on UCP1 
(Fig. le) and acted directly on UCP1 cysteine(s) by oxidative modifi- 
cation (Fig. 3e, f). To examine the role of cysteine oxidation on UCP1 
function, we first identified the site modified during thermogenesis. 
We profiled UCP1 cysteines for sulfenic acid modifications in vivo by 
selective dimedone labelling and mass spectrometry’” (Extended Data 
Fig. 4e) after acute cold exposure. We identified six of seven UCP1 
cysteines, five of which were identified exclusively in the unmodified 
(N-ethyl maleimide (NEM)-labelled) form (Extended Data Fig. 4f and 
Supplementary Data File 1). However, UCP1 Cys253 was identified in 
the sulfenylated (dimedone-labelled) form (Fig. 4a, Extended Data Fig. 4f 
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Figure 2 | BAT mitochondrial ROS during thermogenesis drives 
oxidation of cellular and mitochondrial thiols. a, Distribution of 
percentage reversible oxidation status of BAT protein thiols + acute cold 
exposure. b, Pathway analysis of BAT proteins containing cysteine residues 
sensitive to substantial oxidation (>10% shift in oxidation status) upon 
cold exposure. Top: proteins clustered according to shared GO enrichment 
terms. Bottom: significantly enriched pathways. c, Immunodetection 

of protein sulfenic acid levels in BAT + acute cold exposure (n = 4) 

d, Effect of ip. NAC on core body temperature after acute cold exposure 
(n= 8; 500mgkg-' NAC n=7). e, Oxygen consumed 3h before and 

after i.p. CL+ NAC (control n= 12; NAC n=9). VCL, vinculin. Data 

are mean +s.e.m. of at least four mouse replicates. *P < 0.05, **P<0.01, 
*** P< 0.001 (two-tailed Student’s t-test for pairwise comparisons, 
one-way/two-way ANOVA for multiple comparisons involving one/two 
independent variables). 


and Supplementary Data File 1). Moreover, sulfenic acid occupancy 
on Cys253 increased significantly upon acute cold exposure (Fig. 4b). 
Quantification of dimedone-labelled Cys253 provided an estimate of 
sulfenylation status during cold exposure (8.4%; Fig. 4b). However, owing 
to the unstable nature of protein sulfenic acids!* this must be considered 
a significant under-representation of in vivo levels of this modification. 

Since UCP1 Cys253 was identified as a redox-sensitive site for 
which modification correlated with thermogenesis, we investigated 
the potential structural and functional importance of this residue. We 
modelled UCP1 on the structure of the homologous mitochondrial 
ATP/ADP carrier (AAC)!” (Fig. 4c and Extended Data Fig. 5a). This 
structure represents the matrix-closed ‘c-state’ of the carrier!?, which 
for UCP1 is maintained by inhibitory binding of cytosolic purine 
nucleotides!”°. 

Examining the environment of the Cys253 thiol indicated that it 
resides at an interface between two matrix-facing helices. These helices 
contain residues highly conserved across the carrier family that stabilize 
the ‘c-state’ by inter-helical hydrophobic interactions and a hydrogen 
bond?!’ (Fig. 4d and Extended Data Fig. 5b). This region forms the 
opposing face of the cytosolic cavity where inhibitory nucleotide 
binding is thought to occur**4, and is proximal to a putative site of 
cardiolipin binding on UCP1?! (Fig. 4d). Notably, modelling a sulfenic 
acid modification on the Cys253 thiol suggested that these ‘c-state’ 
interactions could be destabilized owing to local electrostatic effects 
(Extended Data Fig. 5b). Thus, on the basis of this structural model, 
Cys253 oxidation status could play a role in determining stability of 
the matrix closed ‘c-state’ of UCP1, and sensitivity to 6-adrenergic 
activation and uncoupling. 
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Figure 3 | BAT mitochondrial ROS oxidatively modify a cysteine 


residue on UCP1 and support UCP1-dependent leak respiration. 

a, b, OCR of brown adipocytes + noradrenaline (NE) 

stimulation + oligomycin to determine leak respiration + (a) MitoQ 
(n= 10) or (b) NAC (n=8; 0.1 mM NAC n=7). ¢, d, OCR of primary 
UCP1~/~ adipocytes + noradrenaline stimulation + oligomycin + 
(c) MitoQ (n= 10) or (d) NAC (n= 10). e, f, Cys-redox immunoblot of 
BAT UCP1 (e) after a time course of acute cold exposure, and (f) after 
acute cold exposure + 500 mgkg~' NAC. FCCP, carbonyl cyanide-p-trif 
luoromethoxyphenylhydrazone. Data are mean +s.e.m. of at least seven 
cellular replicates. *P < 0.05, **P < 0.01 (two-tailed Student’s t-test for 
pairwise comparisons, one-way ANOVA for multiple comparisons). 


In the cellular milieu, UCP1 is inhibited by endogenous purine 
nucleotides, and this inhibition can be overcome by 3-adrenergic 
stimulation of lipolysis and activation of UCP1-dependent 
respiration’. A prediction of the above model is that altering Cys253 
status should specifically modulate sensitivity to adrenergic activation 
of the purine nucleotide inhibited ‘c-state of UCP1!°. To test this, in 
UCP1 ~~ brown adipocytes we separately transduced WT UCP1 and 
seven mutants, each having one of the seven cysteine residues con- 
tained in the protein replaced with an alanine. We titrated UCP1 trans- 
duction to recapitulate RNA and protein abundance observed in WT 
brown adipocytes (Extended Data Figure 6a—d). In all cases, basal and 
maximal respiration was indistinguishable between UCP1~/~ brown 
adipocytes, UCP1 —I- adipocytes transduced with WT UCP1, and 
UCP1-‘~ adipocytes transduced with each of the seven UCP1 Cys- 
mutants (Extended Data Fig. 6e, f). This agreed with previous studies 
indicating that UCP1 cysteines are not critical for protein stability and 
mitochondrial viability”. 

The UCP1 Cys-mutant cells were subjected to noradrenaline and 
oligomycin to examine the effect of adrenergic activation on leak 
respiration. Noradrenaline stimulated rapid oxidation of protein 
cysteine residues (Extended Data Fig. 6g) that was concomitant 
with increases in OCR (Extended Data Fig. 6h). As expected, adipo- 
cytes lacking UCP1 exhibited a substantial reduction in respiration 
compared with WT UCP1 cells after 100nM noradrenaline and 
oligomycin treatment, with ~35% of total leak OCR determined to be 
UCP 1-dependent (Fig. 4e). Although replacing most UCP1 cysteines 
with alanine sites had no effect (Extended Data Fig. 7a—e), individual 
modification of Cys253 substantially inhibited UCP 1-dependent leak 
respiration (Fig. 4f). Interestingly, individual modification of Cys224 
also inhibited UCP1 leak respiration (Extended Data Fig. 7f). We 
additionally generated a C224A/C253A double mutant (Extended 
Data Fig. 6c, d), which exhibited compromise of UCP1-dependent 
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Figure 4 | UCP1 Cys253 is sulfenylated during thermogenesis and 
sensitizes UCP1 to adrenergic activation. a, MS’ spectrum of dimedone- 
labelled UCP1 Cys253 peptide indicating sulfenylation of this site during 
thermogenesis. Fragment ions that span the dimedone-alkylated cysteine 
are highlighted in the peptide sequence. C#, dimedone-labelled cysteine; 
M*, oxidized methionine. b, Quantification of dimedone-labelled UCP1 
Cys253 relative to the NEM-alkylated form (n=5). ¢, d, Structure of (c) 
human UCP1 modelled on the AAC crystal structure and (d) Cys253 

in a hydrophobic pocket between two matrix facing helices. IMS, 
intermembrane space. e, f, Basal, maximal, and UCP 1-dependent OCR of 
UCP1~/~ brown adipocytes + transduction with (e) WT UCP1 (WT 
n=7; UCP1-/~ n=6) or (f) UCP1 C253A (WT n= 17; C253A n= 19). 


respiration comparable to the individual mutations (Extended Data 
Fig. 7g). 

Replacement of the redox-sensitive Cys253 inhibited adrenergic 
activation of UCP1 by noradrenaline, so we next tested whether this 
site played an obligatory or sensitizing role in this context. To do so, we 
titrated noradrenaline (50-2,000 nM) to trigger variable levels of adr- 
energic stimulation in brown adipocytes (Extended Data Fig. 8a). All 
concentrations of noradrenaline were sufficient to activate WT UCP1- 
dependent leak respiration, which was consistently ~25-35% of total 
leak respiration (Fig. 4g and Extended Data Fig. 8b). UCP1-dependent 
respiration in C253A cells was substantially inhibited after stimula- 
tion with 50nM noradrenaline; however, this inhibition was over- 
come dose-dependently by increasing noradrenaline concentrations 
(Fig. 4g). Thus, increased adrenergic stimulus could overcome inhibi- 
tion by Cys253 replacement, suggesting that Cys253 plays a sensitizing 
(as opposed to an obligatory) role in adrenergically stimulated uncou- 
pled respiration by UCP1. In contrast, exposure to variable adrenergic 
stimulus did not substantially modify the degree of inhibition of UCP1- 
dependent leak respiration in UCP1 C224A cells (Extended Data 


KO, knockout. g, UCP1-dependent leak respiration after stimulation by 
various concentrations of noradrenaline (50nM noradrenaline n = 9; 
100 nM noradrenaline n =7; 500 nM noradrenaline n = 8; 2,000 nM 
noradrenaline n = 6). Comparison of UCP1 WT and C253A indicates 
that degree of UCP1 inhibition by C253A is inversely correlated with 
noradrenaline concentration (n = 19; 100nM WT noradrenaline; 

n= 17,500nM noradrenaline n = 18; 2,000 nM noradrenaline n= 10). 
j, A model of sensitization of UCP1-mediated uncoupling by 
mitochondrial ROS. Data are mean +s.e.m. of at least five mouse 
replicates or cell replicates for respirometry experiments. *P < 0.05, 
**P < 0.01 (two-tailed Student’s t-test for pairwise comparisons). 


Fig. 8c). So, as opposed to a specific effect through altering sensitivity 
to adrenergic activation as in the case of Cys253, Cys224 replacement 
may alter UCP1 structure/function more generally. 

Moreover, permeabilizing adipocytes to dilute cytosolic purine 
nucleotides resulted in uncoupled respiration that was indistinguish- 
able between WT and C253A cells (Extended Data Fig. 8d), and could 
be abrogated by sequestering of endogenous free fatty acids with BSA 
(Extended Data Fig. 8d). In this nucleotide-diluted state, WT and 
C253A UCPI cells exhibited a comparably low membrane potential 
(Az) as assessed by tetramethylrhodamine methyl ester (TMRM) 
fluorescence (Extended Data Figure 8e). As expected, titration of 
GDP resulted in an increase in Ay in UCP1 WT cells owing to GDP- 
dependent inhibition of UCP1!”° (Extended Data Fig. 8f). Notably, 
the C253A and C224A/C253A mutants exhibited increased sensitivity 
to GDP-dependent inhibition (Extended Data Fig. 8f). 

Taken together, these data indicate that acute regulation of ther- 
mogenesis in vivo can be supported by elevation of mitochondrial 
ROS levels. The thermogenic actions of mitochondrial ROS involve 
consequent oxidation of BAT cellular thiols, and UCP1—the major 


7 APRIL 2016 | VOL 532 | NATURE | 115 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


thermogenic effector—is a target of thiol oxidation. We identify UCP1 
Cys253 as a sulfenylation site that is increasingly modified during 
thermogenesis, and provide evidence that this residue can modulate 
sensitivity to activation of UCP1-dependent thermogenic respiration. 

While UCP1 is identified here as a target of redox modification dur- 
ing thermogenesis, it is likely that other functional target proteins exist, 
which may act together to support thermogenic respiration. In addi- 
tion to establishing a role for thiol redox signalling in thermogenesis 
in vivo, these findings provide insight into the longstanding investi- 
gations on the relationship between mitochondrial ROS and UCP1 
(refs 1, 2, 26, 27). Interestingly, studies in this area have relied largely 
on the use of isolated mitochondria and reconstituted systems. Our 
data suggest that redox regulation of UCP1-dependent respiration is 
particularly relevant in adipocytes and in vivo where both redox home- 
ostasis and free fatty-acid concentrations are subject to strict endoge- 
nous regulation; parameters that are necessarily divergent from in vitro 
conditions. Notably, a role for thiol redox status and UCP1-dependent 
uncoupling has been suggested in isolated BAT mitochondria”’, while 
the importance of superoxide in particular for modulating UCP1 func- 
tion is the subject of debate’”°?’, Previous investigation using a man- 
ganese superoxide dismutase transgenic mouse model suggested that 
matrix superoxide did not modify UCP1 function””. This previous work 
suggests that the role of thiol redox status in UCP1-dependent respira- 
tion detailed here is not directly mediated by matrix superoxide, but 
reliant on other ROS and related molecules the levels of which are not 
compromised by elevated manganese superoxide dismutase expression’®. 

Our data are consistent with a model whereby redox modification 
of Cys253 sensitizes purine nucleotide inhibited UCP1 to uncoupling 
due to acute adrenergic stimulus (Fig. 4j), which is compatible with 
fatty-acid-mediated activation and shuttling”®. This model reconciles 
the apparent sensitizing role of both redox status and UCP1 Cys253 
with the sufficient role of fatty acids for UCP 1-dependent uncoupling’, 
as well as previous observations that cysteines are not essential for 
UCP] activity in reconstituted systems’°. However, previous study 
of UCP1 cysteines used yeast expression systems that are subject to 
protein misfolding and artefactual uncoupling”. Although Cys224 
was not identified here as a target for reversible redox modification it 
appears to affect UCP 1-dependent respiration, albeit distinctly from 
the sensitizing role of Cys253. It could be that this site is modified by 
other ROS-dependent species that are known to interact with UCP1 
cysteines*®. Additionally, these sites may also be important for UCP1 
structure/function independent of redox-modification. Finally, it is 
worth noting that UCP1 Cys253 is conserved to a high degree across 
several mitochondrial carriers, including UCP2, UCP3, and AAC. 
This suggests that redox modification of this site may be a perva- 
sive signalling modality for mitochondrial carriers in physiology and 
pathophysiology. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Animal procedures and ethics statement. Animal experiments were performed 
according to procedures approved by the Institutional Animal Care and Use 
Committee of the Beth Israel Deaconess Medical Center. Unless otherwise stated, 
mice used were male C57BL/6] (8-12 weeks of age; Jackson Laboratories), and 
housed in a temperature-controlled (20-22°C) room ona 12h light/dark cycle. 
In vivo compound administration. All compounds administered to mice in vivo 
were injected at the stated dose i.p. 10 min before subsequent interventions unless 
otherwise stated. 

Body temperature and cold exposure. Body temperature and cold expo- 
sure experiments were assessed using a mouse rectal probe (World Precision 
Instruments). When studying acute activation of thermogenesis, mice were housed 
from birth at 20-22°C to allow for recruitment of thermogenic adipose tissue’. 
Before individual housing at 4°C, mice were placed at thermoneutrality (30°C) 
for 3 days which allows both for maintenance BAT UCP] protein content*! and 
for measurement of acute induction of BAT thermogenesis upon cold exposure. 
Upon exposure to 4°C, temperature was measured every 30 min. When studying 
body temperature after 4°C acclimation, WT and Ucp1~/~ mice (equal numbers of 
male and female mice in each group) were acclimated using established protocols: 
mice were individually housed for 1 week at 15°C, 1 week at 10°C, and 24h at 4°C 
before the experiment. 

EMG determination of muscle shivering. Mice were individually restrained to 
limit non-shivering muscle activity and two EMG needle electrodes were inserted 
subcutaneously above the nuchal muscles in the back of the neck. EMG leads 
were connected to a computerized data acquisition system via a communicator. 
EMG was recorded at thermoneutrality to determine non-shivering basal nuchal 
muscle activity, before placement of mice at 4°C. EMG data were collected and 
burst activity was determined as described previously*”. Briefly, EMG data were 
collected from the implanted electrodes at a sampling rate of 2kHz using LabChart 
8 Pro Software (ADInstruments). The raw signal was converted to root mean 
square activity. Root mean square activity was analysed for shivering bursts in 
10s windows. 

Metabolic phenotyping. Whole-body energy metabolism was evaluated using a 
Comprehensive Lab Animal Monitoring System (CLAMS, Columbia Instruments). 
For 6h measurements, mice were acclimated in the metabolic chambers for 48h 
before experiments to minimize stress from the housing change. CO; and O) levels 
were collected every 12 or 32 min for each mouse over the period of the experi- 
ment. For acute measurements, CO? and O; levels were collected every 10s. CL 
316,243 (Sigma-Aldrich; 1 mgkg~') was injected i.p. into mice at the indicated 
times. 

Assessment of mitochondrial aconitase inactivation in mouse BAT. Aconitase 
activity was measured as described previously**. In brief, after the relevant in vivo 
intervention mouse BAT was rapidly excised and homogenized in mitochondrial 
isolation buffer (250mM sucrose, 2mM EDTA, 10mM sodium citrate, 0.6 mM 
MnCl, 100 mM Tris-HCl, pH 7.4) followed by mitochondrial isolation by differ- 
ential centrifugation. Samples (1-2 mg mitochondrial protein) were added to a 
96-well plate and 190 ml assay buffer (50 mM Tris-HCl (pH 7.4), 0.6mM MnCh, 
5mM sodium citrate, 0.2mM NADP*, 0.1% (v/v) Triton X-100, 0.4U ml~! ICDH). 
Absorbance was measured at 340 nm for 7 min at 37°C. To control for mitochon- 
drial content aconitase activity was normalized to citrate synthase activity** and 
expressed the result as a percentage of control levels. 

Assessment of lipid hydroperoxides in mouse BAT. Lipid hydroperoxide content 
in mouse BAT was estimated by rapid snap freezing of BAT tissue followed by 
lipid extraction and assessment using a modified ferric thiocyanate assay (Cayman 
Chemical Lipid Hyroperoxide Assay Kit) according to the manufacturer’s 
instructions. 

Assessment of cysteine redox status by gel shift. Cysteine redox status of Prx3 
and UCP1 was measured as described previously'**». After the relevant in vivo 
intervention, mouse BAT was rapidly excised and homogenized in 100 mM NEM, 
1mM EGTA, 50mM Tris-HCl, pH 7.4. Samples were incubated at 37°C for 5 min 
before the addition of SDS (2% final) and further incubation at 37°C for 10 min. 
Incubations at 37 °C proceeded in a thermomixer at 1,300 r.p.m. Samples were 
then precipitated in five volumes of ice-cold acetone to remove excess NEM before 
resuspension in 1mM EGTA, 2% SDS, 10 mM TCEP, 50 mM Tris-HCl, pH 7.4 con- 
taining a polyelthylene glycol polymer conjugated to maleimide (50 mM PEG-Mal). 
Resuspended samples were incubated for 30 min at 37°C before a second acetone 
precipitation to remove excess PEG-Mal before sample resuspension and immuno- 
blot detection by standard methods described below. For UCP1 experiments, 
to ensure gel shift signals were specific to reversible cysteine oxidation, oxidized 
samples were separately treated with TCEP before differential labelling as described 
above. Calibrating the number of UCP! cysteines oxidized was achieved by treating 
TCEP-reduced samples with increasing proportions of Peg-Mal:NEM to generate a 
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cysteine-dependent ladder*. In addition, to ensure higher molecular mass signals 
were specific to UCP1, UCP1 antibody specificity was tested in BAT. It should 
be noted that while the UCP1 antibody used here is highly specific for UCP1 
in BAT (Extended Data Fig. 4c), the same antibody applied to cultured brown 
adipocyte samples can generate non-specific signals at molecular mass >35 kDa. 
So, the UCP1 gel shift assay as described here is only compatible with in vivo tissue 
experiments. 

Assessment of BAT-reduced and -oxidized glutathione content by mass 
spectrometry. Reduced and oxidized glutathione were profiled in negative ion- 
ization mode by liquid chromatography tandem mass spectrometry (LC-MS) 
methods as described previously**. Data were acquired using an ACQUITY 
UPLC (Waters) coupled to a 5500 QTRAP triple quadrupole mass spectrom- 
eter (AB SCIEX). Tissue homogenates (3011) were extracted using 12011 of 
80% methanol containing 0.05 ng jl! inosine-!Ny, 0.05 ng ul! thymine-dy, 
and 0.1 ngl~! glycocholate-dy as internal standards (Cambridge Isotope 
Laboratories). The samples were centrifuged (10 min, 9,000g, 4°C) and the 
supernatants (10 11) were injected directly onto a 150mm x 2.0mm Luna NH2 
column (Phenomenex). The column was eluted at a flow rate of 400 jl min7! 
with initial conditions of 10% mobile phase A (20mM ammonium acetate and 
20mM ammonium hydroxide (Sigma-Aldrich) in water (VWR)) and 90% mobile 
phase B (10mM ammonium hydroxide in 75:25 v/v acetonitrile/methanol 
(VWR)) followed by a 10 min linear gradient to 100% mobile phase A. The ion 
spray voltage was —4.5kV and the source temperature was 500°C. Raw data were 
processed using MultiQuant 2.1 software (AB SCIEX) for automated peak inte- 
gration. LC-MS data were processed and visually inspected using TraceFinder 
3.1 software (Thermo Fisher Scientific). 

Assessment of BAT protein thiol redox status using iodoTMT. After the relevant 
in vivo intervention, mouse BAT was rapidly excised and homogenized in 20% 
(w/v) TCA to stabilize thiols. The homogenate was incubated on ice for 30min and 
then pelleted for 30 min at 16,000g at 4°C. The pellet was washed with 10% and 
5% (w/v) TCA and then resuspended in 80 1] denaturing alkylating buffer (DAB; 
6M urea, 2% (w/v) SDS, 200 mM Tris-HCl, 10 mM EDTA, 100,.M DTPA, 101M 
neocuproine). The contents of one vial of iodo TMTs reagent (Thermo Scientific) 
was added to each of three biological replicate samples to label reduced cysteine 
residues at 37°C and 1,300 r.p.m. for 1h. Sample protein was precipitated with 
five volumes of ice-cold acetone, incubated at —20°C for 2h, and pelleted at 4°C 
and 16,000g for 30 min. The amount of protein to be processed was optimized 
to ensure saturation of thiol labelling by the iodoTMT reagent as per the 
manufacturer’s instructions. The pellet was washed twice with ice-cold acetone and 
then re-solubilized in 80,11 DAB containing 1 mM tris(2- carboxyethyl)phosphine 
(TCEP), reducing previously reversibly oxidised cysteine residues in the presence 
of a second, distinct iodo TMT, reagent. Proteins were incubated at 37°C and 1,400 
r.p.m. for 1h, precipitated and resuspended for protease digestion. After digestion, 
iodoTMT-labelled cysteine-containing peptides were enriched using the anti- TMT 
resin as per the manufacturer's instructions. 

Profiling of redox-sensitive protein targets. Proteins with cysteine thiols exhib- 
iting differential redox status (defined as > 10% shift in cysteine oxidation status 
upon cold exposure) were assessed for Gene Ontology (GO) term enrichment*”. 
The total identified population of cysteine thiol containing proteins was used as the 
reference background. Enriched GO terms were filtered after benjamini-hochberg 
correction at an adjusted P value <0.1. All data analysis used R (R Core Team, 
Vienna, Austria, http://www.R-project.org). 

Assessment of protein thiol sulfenic acids. Tissue or cellular samples were pre- 
pared adapting a protocol used previously to stabilize endogenous protein sulfenic 
acids**. Briefly, samples were homogenized in 50 mM Tris base, containing 100 mM 
NaCl, 100,1.M DTPA, 0.1% SDS, 0.5% sodium deoxycholate, 0.5% Triton-X 100, 
5mM dimedone. To minimize lysis-dependent oxidation, buffers were bubbled 
with argon before use. Samples were incubated for 15 min at room temperature, at 
which point SDS was added to a final concentration of 1% and samples were incu- 
bated for a further 15 min. After dimedone treatment, 10 mM TCEP and 50 mM 
NEM were added and samples were incubated for a further 15 min at 37°C to 
reduce and alkylate all non-sulfenic acid protein cysteine residues. Protein sulfenic 
acids were then assessed by immunoblotting against dimedone (1:1,000 antibody 
dilution). 

Targeted assessment of UCP1 cysteine sulfenylation status using dimedone-MS. 
After dimedone and NEM labelling of samples as described above, samples were 
resolved by SDS-PAGE and bands in the UCP1 containing region of the gel 
(30-35 kDa) were excised, destained with acetonitrile and subjected to dehydration 
by a speed vacuum concentrator. Gel bands were rehydrated with digestion buffer 
(7511 of 50mM HEPES and 500 ng of trypsin (Promega) and subjected to 12h 
of digestion at 37 °C. Peptides were extracted and labelled with TMT 10 reagents 
(Thermo Fisher) as previously described*’. 
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Protein digestion. Protein pellets were dried and resuspended in 8 M urea con- 
taining 50mM HEPES (pH 8.5). Protein concentrations were measured by BCA 
assay (Thermo Scientific) before protease digestion. Protein lysates were diluted 
to 4M urea and digested with LysC (Wako, Japan) in a 1/100 enzyme/protein ratio 
overnight. Protein extracts were diluted further to a 1.0 M urea concentration, and 
trypsin (Promega) was added to a final 1/200 enzyme/protein ratio for 6h at 37°C. 
Digests were acidified with 2011 of 20% formic acid (FA) to a pH ~2, and subjected 
to C18 solid-phase extraction (Sep-Pak, Waters). 

LC-MS/MS parameters. All spectra were acquired using an Orbitrap Fusion mass 
spectrometer (Thermo Fisher) in line with an Easy-nLC 1000 (Thermo Fisher 
Scientific) ultra-high pressure liquid chromatography pump. Peptides were sep- 
arated onto a 100\1M inner diameter column containing 1 cm of Magic C4 resin 
(51m, 100A, Michrom Bioresources) followed by 30cm of Sepax Technologies 
GP-C18 resin (1.8m, 120 A) witha gradient consisting of 9-30% (ACN, 0.125% 
FA) over 180 min at ~250nl min“. For all LC-MS/MS experiments, the mass 
spectrometer was operated in the data-dependent mode. We collected MS! 
spectra at a resolution of 120,000, with an AGC target of 150,000 and a maxi- 
mum injection time of 100 ms. The ten most intense ions were selected for MS” 
(excluding 1 Z-ions). MS! precursor ions were excluded using a dynamic window 
(75s+10ppm). The MS? precursors were isolated with a quadrupole mass filter 
set to a width of 0.5Th. For the MS? based TMT quantitation, MS? spectra were 
collected at an AGC of 4,000, maximum injection time of 150 ms, and CID collision 
energy of 35%. MS? spectra were acquired with the same Orbitrap parameters as 
the MS? method except HCD collision energy was increased to 55%. Synchronous- 
precursor-selection was enabled to include up to six MS? fragment ions for the 
MS? spectrum. 

Data processing and MS? spectra assignment. A compilation of in-house software 
was used to convert .raw files to mzXML format, as well as to adjust monoisotopic 
m/z measurements and erroneous peptide charge state assignments. Assignment 
of MS? spectra was performed using the SEQUEST algorithm”. All experiments 
used the Mouse UniProt database (downloaded 10 April 2014) where reversed 
protein sequences and known contaminants such as human keratins were 
appended. SEQUEST searches were performed using a 20 ppm precursor ion 
tolerance, while requiring each peptide’s amino/carboxy (N/C) terminus to have 
trypsin protease specificity and allowing up to two missed cleavages. lodoTMT 
tags on cysteine residues residues (+329.226595 Da) was set as static modifica- 
tions, while methionine oxidation (+15.99492 Da) was set as variable modifi- 
cations. For targeted assessment of UCP1 cysteine sulfenylation, TMT tags on 
lysine residues and peptide N termini (+229.16293 Da), NEM on cysteine residues 
(+125.047679 Da) were set as static modifications and oxidation of methionine 
residues (+15.99492 Da) and dimedone on cysteine residues (+13.020401 Da 
versus NEM) as variable modifications. Determination of sulfenylation status of 
the Cys253 peptide was determined by comparing TMT reporter ion abundance 
of the dimedone-alkylated and NEM-alkylated peptides as a proportion of total 
precursor ion intensity. An MS? spectra assignment false discovery rate of less than 
1% was achieved by applying the target-decoy database search strategy*’. Protein 
filtering was performed using an in-house linear discrimination analysis algorithm 
to create one combined filter parameter from the following peptide ion and MS2 
spectra metrics: XCorr, ACn score, peptide ion mass accuracy, peptide length and 
missed-cleavages™. Linear discrimination scores were used to assign probabilities 
to each MS? spectrum for being assigned correctly, and these probabilities were 
further used to filter the data set to a 1% protein-level false discovery rate. 
Determination of iodoTMT reporter ion intensities and quantitative data 
analysis. For quantification, a 0.03m/z window centred on the theoretical Th 
value of each reporter ion was used for the nearest signal intensity. Reporter ion 
intensities were adjusted to correct for the isotopic impurities from the different 
TMT reagents (manufacturer specifications). The signal to noise values for all 
peptides were summed within each TMT channel. For each peptide, a total mini- 
mum sum signal to noise value of 200 and an isolation purity greater than 70% was 
required*’. Percentage cysteine oxidation status of protein thiols was calculated as 
the percentage of the cysteine containing peptide (total or mitochondrial) labelled 
with iodoTMT (129, 130, 131) for each condition over the sum of the reduced 
peptide labelled with iodoTMT (126, 127, 128) plus reversibly oxidized labelled 
peptide (129, 130, 131): (oxidized peptide 129, 130, 131)/(reduced peptide 126, 
127, 128 + oxidized peptide 129, 130, 131) x 100. 

Primary brown adipocyte preparation and differentiation. Interscapular 
brown adipose stromal vascular fraction was obtained from 2- to 6-day-old pups 
as described previously“. Interscapular brown adipose was dissected, washed in 
PBS, minced, and digested for 45 min at 37°C in PBS containing 1.5 mg ml colla- 
genase B, 123 mM NaCl, 5mM KCl, 1.3mM CaCh, 5mM glucose, 100 mM HEPES, 
and 4% essentially fatty-acid-free BSA. Tissue suspension was filtered through a 
40 jm cell strainer and centrifuged at 600g for 5 min to pellet the SVF. The cell 


pellet was resuspended in adipocyte culture medium and plated. Primary brown 
pre-adipocytes were counted and plated in the evening, 12h before differentia- 
tion at 15,000 cells per well of a seahorse plate. Pre-adipocyte plating was scaled 
according to surface area. The following morning, brown pre-adipocytes were 
induced to differentiate for 2 days with an adipogenic cocktail (11M rosiglitazone, 
0.5mM IBMX, 51M dexamethasone, 0.114 ,.g ml insulin, 1 nM T3, and 125 1M 
Indomethacin) in adipocyte culture medium. Two days after induction, cells were 
re-fed every 48h with adipocyte culture medium containing 11M rosiglitazone 
and 0.5.g ml! insulin. Cells were fully differentiated by day 5 after induction. 
Primary brown adipocyte cellular respirometry. Cellular OCR of primary brown 
adipocytes was determined using a Seahorse XF24 Extracellular Flux Analyzer. 
Adipocytes were plated and differentiated in XF24 V7 cell culture microplates. 
Before analysis adipocyte culture medium was changed to DMEM respiration 
medium lacking NaHCO; (Sigma), and including 1.85 g1~! NaCl, 3mg1~' phenol 
red, 2% fatty-acid-free BSA, 1mM sodium pyruvate, pH 7.4. Basal respiration 
was determined to be the OCR in the presence of substrate alone. ATP-synthase- 
independent respiration was determined after addition of 2.5|1M oligomycin. 
Unless otherwise stated, leak respiration was determined after addition of 2.5 1M 
oligomycin and 100 nM noradrenaline. Maximal respiration was determined after 
addition of 2 1M FCCP. To determine OCR after plasma membrane permeabiliza- 
tion, cells were treated with 501g ml! saponin, and sequestration of free fatty acids 
after permeabilization was achieved through addition of 2% fatty-acid-free BSA. 
Cloning, construct design, and site-directed mutagenesis. RNA from murine 
BAT was reverse-transcribed and used as template for PCR of Ucp1. Sequences 
for Ucp1 amplification were as follows: sense, CAC CAT GGT GAA CCC GAC 
AAC TTC CG; antisense, TTA TGT GGT ACA ATC CAC TG. PCR fragments 
were gel-purified and cloned into the pENTR/D-TOPO entry vector according 
to the manufacturer’s instructions (Invitrogen; K2400). Cloned Ucp1 was shut- 
tled into the pAd/CMV/V5-DEST Gateway vector, and confirmed by sequencing. 
Cysteine mutants were generated using the Quik-Change site-directed mutagenesis 
kit (Stratagene). Primers for generating mutants were as follows: Ucp1 C24A 
forward 5'-AGCCGGAGTTTCAGCTGCCCTGGCAGATATCATC-3’, reverse 
5’-GATGATATCTGCCAGGGCAGCTGAAACTCCGGCT-3’; Ucp1 C188A 
forward 5'-TGAGAAATGTCATCATCAATGCTACAGAGCTGGTAACATATG-3’, 
reverse 5‘-CATATGTTACCAGCTCTGTAGCATTGATGATGACATTTCTCA-3; 
UCP1 C213A forward 5'-TGGCAGATGACGTCCCCGCCCATTTACT 
GTCAGCTC-3’, reverse 5’-GAGCTGACAGTAAATGGGCGGGGACG 
TCATCTGCCA-3’; Ucp1 C224A forward 5’-TCTTGTTGCCGGGTT 
TGCCACCACACTCCTGGCC-3’, reverse 5'/-GGCCAGGAGTGTGGTG 
GCAAACCCGGCAACAAGA-3’; Ucp1 C253A forward 5’-CCCAAGC 
GTACCAAGCGCTGCGATGTCCATGTAC-3’, reverse 5‘-GTACATGGAC 
ATCGCAGCGCTTGGTACGCTTGGG-3’; Ucp1 C287A forward 5‘-GGAAC 
GTCATCATGTTTGTGGCCTTTGAACAGCTGAAAAAAG-3’, reverse 
5/-CTTTTTTCAGCTGT TCAAAGGCCACAAACATGATGACGTTCC-3’; Ucp1 
C304A forward 5‘-CAGACAGACAGTGGATGCTACCACATAAGGATCC-3’, 
reverse 5’-GGATCCT TATGTGGTAGCATCCACTGTCTGTCTG-3’. 
Production of adenovirus and transduction of cysteine-null UCP1 mutants. 
pAd/CMV/V5-DEST/Ucp1 was linearized with Pacl and transfected (31g) into 
293A cells with lipofectamine 2000 (Invitrogen). Crude adenovirus was gener- 
ated according to the manufacturer’s instructions (Invitrogen; V493-20). Crude 
adenovirus was amplified by infecting 293A cells, and purified using the Fast 
Trap Adenovirus Purification and Concentration Kit (EMD Millipore). Virus 
was quantified by examining viral DNA. Briefly, viral particles were treated with 
Proteinase K and DNA was isolated with phenol and chloroform/isoamylalcohol 
(24:1). Preliminary experiments with titrations of viral transductions in Ucp1~/~ 
adipocytes were used to determine the amount of virus yielding a Ucp1 messenger 
RNA (mRNA) and protein level similar to the level detected from Ucp1*'* adipo- 
cytes. For subsequent experiments, primary brown adipocytes were transduced 
with purified adenovirus in the evening of day 3 after differentiation with medium 
replacement the following morning. Adipocytes were used for experiments on 
day 5 after differentiation. 

Structural modelling of UCP1. A comparative model of UCP1 was built by using 
the structure of the bovine AAC”. This structure corresponds to the ‘c-state’ of 
the carrier—open to the mitochondrial inner membrane. The protein sequence 
of human UCP1 was taken from UniProt. To align the AAC and UCP1 sequences, 
MUSCLE* and manual editing in Jalview*® were used. To improve the quality 
of the comparative models, the alignments were edited to remove the N- and 
C-terminal residues of the UCP1 sequences that did not align with resolved 
residues in the AAC structure, and to place gaps in the UCP1 sequences so as 
to minimize the distance between these residues in the initial target structure. 
Fifty comparative models of human UCP1 were built from the AAC structure 
and the sequence alignment by using MODELLER. The structure with the lowest 
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MODELLER energy score was taken as the best representative structure. The 
cardiolipin molecules of the AAC were added to the modelled UCP1 structure by 
aligning the two structures, and copying the lipid molecules*"”””. This structure 
was examined and figures produced by using the PYMOL molecular visualization 
system (PyMOL Molecular Graphics System, version 1.4.1, Schrédinger). 
Assessment of superoxide levels in primary brown adipocytes. ROS produc- 
tion was estimated by oxidation of DHE and ratiometric assessment as described 
previously**. Cells were plated and differentiated onto 96-well plates suitable for 
fluorescence analysis. Before imaging, cell media was removed and replaced with 
imaging buffer (156 mM NaCl, 1.25mM KH>PO,, 3mM KCl, 2mM MgCh, 10mM 
HEPES, pH 7.4) supplemented with 1 mM sodium pyruvate. Cells were loaded with 
5\1M DHE (Invitrogen), which remained present throughout the time course. DHE 
was excited at 355nm and the emitted signal was acquired at 460 nm. Oxidized 
DHE was excited at 544nm and emission was acquired at 590 nm. 

Assessment of mitochondrial membrane potential in permeabilized primary 
brown adipocytes. Mitochondrial membrane potential was measured in perme- 
abilized cells using TMRM (Life Technologies) in dequench mode. In this mode, 
mitochondrial depolarization causes redistribution of a high concentration of 
signal quenched TMRM from mitochondria to the cytosol, such that the lower 
concentration results in dequenching and an increase in fluorescence“. Cells were 
pre-loaded at room temperature with imaging buffer containing 1 4M TMRM. 
TMRM fluorescence was excited at 544 nm and emission was collected at 590 nm. 
Quantitative RT-PCR. Total RNA was extracted from frozen tissue using 
TRIzol (Invitrogen), purified with RNeasy Mini spin columns (QIAGEN) and 
reverse transcribed using a High-Capacity cDNA Reverse Transcription kit 
(Applied Biosystems). The resultant complementary DNA (cDNA) was ana- 
lysed by quantitative PCR with reverse transcription (qRT-PCR). Briefly, 20 ng 
cDNA and 150 nmol of each primer were mixed with SYBR GreenER qPCR 
SuperMix (Applied Biosystems). Reactions were performed in a 384-well format 
using an ABI PRISM 7900HT real time PCR system (Applied Biosystems). 
Relative mRNA levels were calculated using the comparative CT method 
and normalized to cyclophilin mRNA. The following primers were used in 
these studies: Cyclophilin forward 5’-GGAGATGGCACAGGAGGAA-3’, 
reverse 5'‘-GCCCGTAGTGCTTCAGCTT-3’; Ucp1 forward 5/- 
ACTGCCACACCTCCAGTCATT-3’, reverse 5’/-CTTTGCCTCACTCAG 
GATTGG-3’; Dio2 forward 5‘-CAGTGTGGTGCACGTCTCCAATC-3’, 
reverse 5‘-TGAACCAAAGTTGACCACCAG-3’; Pgcla forward 
5'!-CCCTGCCATTGTTAAGACC-3’, reverse 5/-TGCTGCTGTTCCTGTTTTC-3’; 
PPAR-y forward 5’-TGAAAGAAGCGGTGAACCACTG-3’, reverse 
5'-TGGCATCTCTGTGTCAACCATG-3’; Pgc18 forward 5’-CTGACGT 
GGACGAGCTTTCA-3’, reverse 5’-CGTCCTTCAGAGCGTCAGAG-3’; 
Nrf2 forward 5’-CCAGCTACTCCCAGGTTGCC-3’, reverse 5’-GGGA 
TATCCAGGGCAAGCGA-3’; Ap2 5’-AAGGTGAAGAGCATCATAACCCT-3’, 
reverse 5/-TCACGCCTTTCATAACACATTCC-3’. 

Glycerol release. Adipocytes were incubated in respiration medium absent BSA 
and treated with indicated concentrations of noradrenaline for 2h before collec- 
tion of medium and quantification of glycerol using free glycerol reagent (Sigma- 
Aldrich) relative to glycerol standard and normalized to protein content. 
Immunoblot analysis. Immunodetection after SDS-PAGE used the following 
antibodies: UCP1 (Abcam ab10983), Prx3 (Abcam ab16751), Dimedone (Millipore 
07-2139), Vinculin (Sigma V9264), ATP5A and NDUFB8 (Abcam ab110413), 
ATGL (CST 2138), ATGL pS406 (Abcam ab135093), HSL (CST 4107), HSL pS660 
(CST 4126), pPKA substrate (CST 9624 s), PPAR-y (CST 2435S). 
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Statistical analyses. Data were expressed as mean +s.e.m. and P values were 
calculated using two-tailed Student's t-test for pairwise comparisons, one-way 
ANOVA for multiple comparisons, and two-way ANOVA for multiple compari- 
sons involving two independent variables. ANOVA analyses were subjected to 
Bonferroni’s post hoc test. Sample sizes were determined on the basis of previous 
experiments using similar methodologies. For in vivo studies, mice were ran- 
domly assigned to treatment groups. Mass spectrometric analyses were blinded 
to experimental conditions. 
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Extended Data Figure 1 | Assessing superoxide in brown adipocytes 
and effect of mitochondria-targeted compounds on shivering, body 
temperature, and movement. a, Noradrenaline stimulates superoxide- 
dependent oxidation of DHE in primary brown adipocytes (n=5). 

b, Effect of i-p. decyl-TPP on core body temperature after acute cold 
exposure (n= 10). c, d, Representative (c) raw and (d) root mean 
square mouse EMG traces at thermoneutrality and after acute cold 
exposure + MitoQ, NAC, or curare (0.3 mg kg). e, Quantification of 
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t MitoQ or NAC (n=3). f, Absolute oxygen 
consumed immediately before acute CL treatment described in Fig. 1f 
(n=5). g, Effect of ip. CL + MitoQ on movement as assessed by number 
of beam breaks (n = 8). NS, not significant. Data are mean £s.e.m. of 

at least three replicates. *P < 0.05, **P< 0.01, ***P < 0.001 (two-tailed 
Student's t-test for pairwise comparisons, one-way ANOVA for multiple 
comparisons, two-way ANOVA for multiple comparisons involving two 
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Extended Data Figure 2 | Assessing thiol redox status in vivo and the 
effect of NAC on movement. a, Mass spectrometric quantification of 
BAT-reduced and -oxidized glutathione at thermoneutrality and after 
acute cold exposure (n =5). b, Scheme for quantitative assessment of 
protein thiol redox status by ratiometric labelling of BAT protein cysteine 
thiols. In vivo BAT thiol status is stabilized by protein precipitation in 
20% TCA”. Unmodified cysteine thiols are labelled with ‘light’ iodoTMT 
tags (126, 127, 128). After removal of light iodoTMT, reversibly modified 
protein thiols are reduced with TCEP in the presence of ‘heavy’ iodo TMT 
(129, 130, 131). Samples are combined and subjected to trypsin digestion. 


Ratiometric assessment of iodoTMT labelled peptides provides a 
quantitative profile of overall protein cysteine thiol redox status. 

c, Average percentage oxidation status of total BAT and BAT mitochondrial 
protein thiols at thermoneutrality and after acute cold exposure (n = 3). 

d, Effect of ip. NAC on movement as assessed by number of beam breaks 
(vehicle, n= 11; NAC, n=7). Data are mean +s.e.m. of at least three 
replicates. *P < 0.05, ***P < 0.001 (two-tailed Student’s t-test for pairwise 
comparisons, one-way ANOVA for multiple comparisons, two-way 
ANOVA for multiple comparisons involving two independent variables). 
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Extended Data Figure 3 | Assessing thermogenic gene expression, 
adrenergic response parameters, and strategy for determination of 
UCP1 cysteine thiol redox status. a, b, Quantitative polymerase chain 
reaction (qPCR) analysis of mRNA expression of selected (a) BAT and 
(b) inguinal white adipose tissue (WAT) genes + cold exposure, + MitoQ or 
NAC (n=5). ¢, d, Immunoblot analysis of (c) PPAR-y protein expression 
levels and (d) lipolytic phosphorylation cascade activation in BAT after 
cold exposure + MitoQ. e, f, Raw OCR of primary brown adipocytes 
under basal conditions and after noradrenaline stimulation + oligomycin 
to determine leak respiration + (e) MitoQ (n= 10) or (f) NAC (vehicle 
and 1mM NAC, n=8; 0.1mM NAC, n=7). g, h, OCR of primary brown 


adipocytes lacking UCP1 under basal conditions and after noradrenaline 
stimulation + oligomycin + (g) MitoQ (n= 10) and (h) NAC (n= 10). 

i, Cys-redox mass shift strategy. After in vivo interventions, mouse BAT 
is excised and unmodified protein thiols are labelled with NEM, after 
which reversibly oxidized thiols are chemically reduced and labelled with 
polyethyleneglycol maleimide (PEG-Mal) allowing for determination of 
cysteine oxidation status on UCP1. Data are mean +s.e.m. of at least five 
replicates. *P < 0.05, **P < 0.01 (two-tailed Student's t-test for pairwise 
comparisons, one-way ANOVA for multiple comparisons, two-way 
ANOVA for multiple comparisons involving two independent variables). 
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Extended Data Figure 4 | Assessing UCP1 reversible cysteine oxidation 
status in vivo by immunoblot and mass spectrometry. a, Calibration of 
UCP! cysteine gel shift immunoblot. Calibration of cysteine-dependent 
shifts by incubation of BAT protein with TCEP and different ratios of 
NEM and PEG-Mal indicates that a single PEG-mal labelling event shifts 
UCP1 by ~10kDa above the native molecular mass. b, Calibration of 
UCP1 cysteine oxidation status indicates that the gel shift observed upon 
cold exposure (lane 1) is cysteine dependent, as TCEP pretreatment 
results in a loss of the shift (lane 2). In addition, the cysteine-dependent 
mass shift is due to a single oxidation event as determined by including 
the calibrating markers (lanes 4-6). c, Calibration of specificity of 

UCP1 antibody in BAT. d, Reducing and non-reducing SDS-PAGE 
analysis of UCP1 to monitor cysteine-dependent and -independent 
inter-protein interactions during acute cold exposure. e, Scheme for 
identification of sulfenylated cysteines on UCP1 by dimedone labelling 
and mass spectrometry. After acute cold exposure, BAT protein thiols are 
differentially alkylated with dimedone to selectively label sulfenylated 


thiols and NEM to label non-sulfenylated thiols. Samples are subjected to 
trypsin digestion, followed by Lys-TMT labelling, and MS quantification 
of UCP1 cysteine containing peptides in their dimedone and NEM 
alkylated forms. Two technical points should be noted in this strategy 
when interpreting relative quantitation of NEM and dimedone-alkylated 
peptides. First, these differently alkylated peptides may not necessarily 
ionize with the same efficiency. Second, NEM is reported to react with 
sulfenic acids albeit less efficiently than with free thiols*°, which should 
be factored in when considering the order of addition of dimedone/ 
NEM and potential underestimation of sulfenylation status. f, Top: 
amino-acid sequence alignment of UCP1 proteins highlighting the 
candidate cysteine residues contained within the mouse protein and their 
level of conservation across various species. Bottom: summary of MS 
determination of UCP1 cysteine sulfenylation status. Six of seven UCP1 
cysteines were identified, with all but one being identified exclusively in 
the unmodified (NEM-alkylated state). Cys253 is identified as dimedone 
labelled, indicating that it is a site for sulfenylation. 
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Extended Data Figure 5 | Structure of human UCP1 modelled on the residues (pink) surround the Cys253 thiol, and a hydrogen bond between 
AAC crystal structure including bound cardiolipin and sulfenylation of | Arg238 and Glu261 (aqua) is proximal. These residues that stabilize 
Cys253. a, Entire UCP1 modelled structure including bound cardiolipin interaction between the matrix facing helices are probably important 
(green), and Cys253 in the oxidized sulfenic acid form. b, UCP1 region for stabilization of the purine bound ‘c-state’ of the carrier. Two separate 
containing Cys253 in the oxidized sulfenic acid form. Cys253 localizes to cardiolipin (green) binding domains are localized proximal to Cys253 

a hydrophobic pocket between two matrix facing helices. Hydrophobic within the UCP1 modelled structure. 
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Extended Data Figure 6 | Assessing transduced UCP1 constructs,OCR, forms relative to WT across separate transduction experiments (n = 4; 
and sulfenylation in brown adipocytes. a, Quantitative PCR analysis of C224A/C253A n= 3). e, f, Summary of basal (e) and maximal (f) OCR of 


UCP1 mRNA in WT, and UCP1~/~ brown adipocytes transduced with primary brown adipocytes containing cysteine-null UCP1 mutants. Raw 
WT and cysteine null UCP1 mutants (n= 4). b, Immunoblot of UCP1 OCR values provided in Extended Data Fig. 7. g, Immunodetection of 
protein in WT and UCP1~’~ brown adipocytes transduced with WT and protein sulfenic acid levels in primary brown adipocytes in the seconds 
cysteine null UCP1 mutants. c, Immunoblot analysis of UCP1 protein after treatment with 100 nM noradrenaline. h, Time course of brown 

in UCP1~/~ brown adipocytes transduced with C224A/C253A double adipocyte OCR after stimulation with 100 nM noradrenaline (m= 12). 


mutant compared with WT. d, Densitometry analysis of transduced UCP1 Data are mean +s.e.m. of at least three replicates. 
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Extended Data Figure 7 | Assessing OCR under basal and (WT n= 11; C24A n= 12), (b) C188A (n=7), (c) UCP1 C213A (n= 19), 


FCCP-stimulated maximal respiration, and after noradrenaline (d) UCP1 C287A (WT n=9; C287A n= 10), (e) UCP1 C304A (WT n=7; 
stimulation + oligomycin in UCP1~'~ primary brown adipocytes C304A n= 10), (f) UCP1 C224A (WT n=9; C224A n= 10), (g) UCP1 
transduced with UCP1 cysteine null mutants. a—g, Raw basal, maximal, C224A/C253A (n= 8). Data are mean + s.e.m. of at least seven replicates. 
and UCP1-dependent OCR from representative experiments of WT **P < 0.01 (two-tailed Student’s t-test for pairwise comparisons). 

and UCP1~'~ brown adipocytes transduced with (a) UCP1 C24A 
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Extended Data Figure 8 | Assessing UCP 1-dependent respiration 

and uncoupling following increasing degrees of adrenergic stimulus. 
a, Glycerol release from brown adipocytes as an index of adrenergic 
stimulus and lipolysis in response to increasing concentrations 

of noradrenaline (n = 4; 0 and 2,000 nM noradrenaline n=6). 

b, Representative raw noradrenaline + oligomycin leak OCR values 

in WT and UCP1-/~ brown adipocytes after stimulation with a range 

of noradrenaline concentrations indicates that UCP1-dependent leak 
respiration is consistently ~25-35% of total leak OCR (50nM n=9; 
100nM n=7; 500nM n=8; 2,000nM n=6). c, Assessment of 

UCP 1-dependent leak respiration after stimulation by various 
concentrations of noradrenaline + oligomycin. Comparison of WT and 
UCP1~'~ OCR (replotted data from Fig. 4 for comparison) indicates that 
UCP1-dependent respiration is consistently ~25-35% of leak respiration. 


Comparison of UCP1 WT and C224A indicates that the degree of UCP1 
inhibition by C224<A is relatively stable across various noradrenaline 
concentrations (n = 8; C224A 100nM noradrenaline n = 9, 2,000 nM 
noradrenaline n= 14). d, Plasma-membrane-permeabilized OCR of 
brown adipocytes + endogenous fatty-acid depletion with BSA (WT 

n= 30; KO n= 20; C224A n=9; C253A n= 8; C224A/C253A n= 10). 

e, Comparison of UCP1-dependent uncoupling absent purine nucleotide 
inhibition in plasma-membrane-permeabilized brown adipocytes (n = 6; 
C224A/C253A, KO n=4). f, Comparison of UCP1-dependent uncoupling 
after titration of GDP in permeabilized adipocytes containing WT UCP1, 
UCP1 C224A, C253A, and C224A/C253A (n=6; C224A/C253A n= 12). 
Data are mean +s.e.m. of at least four replicates. *P < 0.05, **P<0.01, 
***P < ().001 (two-tailed Student's t-test for pairwise comparisons, 
one-way ANOVA for multiple comparisons). 
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Lypd8 promotes the segregation of flagellated 
microbiota and colonic epithelia 


Ryu Okumura!, Takashi Kurakawa!, Takashi Nakano*, Hisako Kayama!?, Makoto Kinoshita), Daisuke Motooka‘, 
Kazuyoshi Gotoh*”, Taishi Kimura!, Naganori Kamiyama!, Takashi Kusu', Yoshiyasu Ueda!, Hong Wu, Hideki lijima®, 
Soumik Barman!*, Hideki Osawa’, Hiroshi Matsuno’, Junichi Nishimura’, Yusuke Ohba®, Shota Nakamura‘, Tetsuya lida*”, 
Masahiro Yamamoto”, Eiji Umemoto!?, Koichi Sano* & Kiyoshi Takeda! 


Colonic epithelial cells are covered by thick inner and outer mucus 
layers’. The inner mucus layer is free of commensal microbiota, 
which contributes to the maintenance of gut homeostasis*®. In 
the small intestine, molecules critical for prevention of bacterial 
invasion into epithelia such as Paneth-cell-derived anti-microbial 
peptides and regenerating islet-derived 3 (RegIII) family proteins 
have been identified”"!!. Although there are mucus layers providing 
physical barriers against the large number of microbiota present 
in the large intestine, the mechanisms that separate bacteria and 
colonic epithelia are not fully elucidated. Here we show that Ly6/ 
PLAUR domain containing 8 (Lypd8) protein prevents flagellated 
microbiota invading the colonic epithelia in mice. Lypd8, 
selectively expressed in epithelial cells at the uppermost layer of 
the large intestinal gland, was secreted into the lumen and bound 
flagellated bacteria including Proteus mirabilis. In the absence of 
Lypd8, bacteria were present in the inner mucus layer and many 
flagellated bacteria invaded epithelia. Lypd8~/~ mice were highly 
sensitive to intestinal inflammation induced by dextran sulfate 
sodium (DSS). Antibiotic elimination of Gram-negative flagellated 
bacteria restored the bacterial-free state of the inner mucus layer 
and ameliorated DSS-induced intestinal inflammation in Lypd8~/— 
mice. Lypd8 bound to flagella and suppressed motility of flagellated 
bacteria. Thus, Lypd8 mediates segregation of intestinal bacteria 
and epithelial cells in the colon to preserve intestinal homeostasis. 

The intestinal mucosa is protected from commensal microbes and 
pathogenic microorganisms by several types of barriers®®, including 
mucus layers produced by intestinal secretory cells’. Compared with 
those in the small intestine, epithelial cells in the large intestine are 
covered by thick mucus composed of two layers: the inner firm mucus 
layer and outer loose mucus layer. Abundant numbers of commensal 
microbiota are present in the outer mucus layer, whereas none is 
present in the inner mucus layer**. The firm inner mucus layer acts 
as a physical barrier, but it is possible that other mechanisms for 
maintaining the segregation of bacteria and colonic epithelia also exist. 
To address this issue, we searched for a gene that is exclusively expressed 
in the large intestine in a public database (http://biogps.org/#goto= 
welcome), and found that Lypd8 was selectively expressed in epithelial 
cells of the mouse gastrointestinal tract, especially the caecum and 
large intestine (Fig. 1a, b). Lypd8 was expressed in germ-free mice at 
a similar level to that in specific-pathogen-free (SPF) mice (Extended 
Data Fig. 1a). Lypd8 was expressed at normal levels in Myd88~/~ and 
Asc~!~ (also known as Pycard) mice (Extended Data Fig. 1b), indicating 


that Lypd8 expression is not induced by microbiota-dependent 
inflammatory signals. We observed high Lypd8 mRNA expression 
in situ in the uppermost layer of the colonic gland exposed to the lumen 
of the large intestine (Fig. 1c), and when introduced into a polarized 
human colorectal carcinoma Caco-2 cell line, Lypd8 was expressed at 
the apical surface (Extended Data Fig. 1c). Thus, Lypd8 is expressed 
on the apical surface of epithelial cells located at the uppermost layer 
of the colonic gland. 

On the basis of amino acid length, the expected molecular weight 
of Lypd8 protein is about 25 kilodaltons (kDa). However, western 
blot analysis showed a band for recombinant Lypd8 protein that was 
greater than 100 kDa. Lypd8 carries 13 asparagine (Asn) residues that 
are potential N-glycosylation sites. We therefore treated recombinant 
Lypd8 protein with peptide-N-glycosidase F (PNGase F), an amidase 
that cleaves between the innermost N-acetylglucosamine and Asn 
residues of the N-linked glycoproteins. Treatment with PNGase F 
substantially reduced the molecular mass of Lypd8 proteins (Fig. 1d). 
Treatment with sialidase A gave a more diffuse band, as sialic acids 
were removed, and the addition of O-glycanse did not further alter the 
mobility, showing that Lypd8 did not contain any Gal$(1-3)GalNAc- 
O-glycans. Replacing all of the Asn residues with Asp resulted in the 
appearance of a 25 kDa Lypd8 protein (Extended Data Fig. 1d), indi- 
cating that Lypd8 is a highly N-glycosylated protein. 

Lypd8 is a member of the Ly6/PLAUR family of proteins, many of 
which are glycophosphatidylinositol (GPI)-anchored!”!?. We treated 
a mouse colon cancer CMT93 cell line stably expressing Flag-tagged 
Lypd8 with phosphatidylinositol-specific phospholipase C (PI-PLC), 
which cleaves and releases GPI-anchored proteins from the cell mem- 
brane, and found that surface Lypd8 expression was severely reduced 
after PI-PLC treatment (Extended Data Fig. le). We then examined 
whether Lypd8 is secreted into the intestinal lumen using immuno- 
histochemistry with a newly established anti-Lypd8 monoclonal anti- 
body (mAb) and Carnoy’s fixative, which preserves the mucus layer? 
(Fig. le). Positive staining for Lypd8 was observed in epithelial cells at 
the uppermost layer of the colonic gland, and a positive signal was also 
observed in the intestinal lumen. In particular, an intense Lypd8 signal 
was observed at the interface of inner and outer mucus layers that are 
exposed to commensal bacteria. In addition, analyses using Lypd8°"™"* 
mice, in which modified yellow fluorescent protein is expressed under 
the control of the Lypd8 promoter, showed that Lypd8-expressing cells 
also expressed Villin, but not Muc2 or Lgr5 (Extended Data Fig. 2). 
Thus, Lypd8 is a GPI-anchored highly glycosylated protein expressed in 
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Figure 1 | Molecular characteristics of Lypd8. a, b, Quantitative reverse 
transcription (RT)—PCR of Lypd8 expression in various organs (a), and in 
the epithelia and lamina propria of the colon (b). Values were normalized 
to Gapdh expression. Data represent the mean + s.d. (n =4 per group). 
#*P < 0,01, ***P< 0.005, ****P< 0,001, NS, not significant. c, In situ 
hybridization of colon sections with anti-sense and sense Lypd8 

RNA probes. Scale bars, 50,1m. d, Western blot of recombinant 
Flag-tagged Lypd8 proteins untreated or treated with sialidase A, PNGase 
F or O-glycanase. e, Immunostaining with anti-Lypd8 antibody (green), 
FISH using an EUB338 probe (red), and 4’,6-diamidino-2-phenylindole 
(DAPI; blue) of Carnoy’s fixed colon sections. Scale bar, 50 jum. 

f, Immunostaining (upper panel) with anti-Lypd8 antibody (red) and 
DAPI (blue) and H&E staining (lower panel) of human colon sections 
from the normal mucosa of a colon cancer patient and the inflamed 
mucosa of two ulcerative colitis patients. Scale bar, 100 1m. 


enterocytes located at the uppermost epithelial layer of the colon, which 
is cleaved and secreted into the colonic lumen. In healthy human intes- 
tinal sections, LYPD8 was preferentially and highly expressed on the 
epithelial layer exposed to the lumen (Fig. 1f); however, LYPD8 stain- 
ing was severely decreased in patients with ulcerative colitis (Fig. 1f), 
indicating the possible involvement of this protein in its pathogenesis. 

We next analysed the effect of the Lypd8 deficiency (Extended Data 
Fig. 3). Lypd8~/~ mice were healthy when reared in an SPF facility 
in which bacterial contamination was strictly controlled. Histological 
analysis of haematoxylin and eosin (H&E)-stained colonic tissue 
showed a region that was free of luminal contents above the epithelial 
cell layer in wild-type mice, but was colonized by bacteria in Lypd3~/~ 
mice (Fig. 2a). We examined the presence of intestinal bacteria adja- 
cent to the epithelial layer in the large intestine by fluorescence in situ 
hybridization (FISH) using a universal bacterial 16S ribosomal RNA 
(rRNA) probe (Fig. 2b, c). This area was free from bacteria in wild-type 
mice, but bacteria were detected in close proximity to the epithelial lay- 
ers in Lypd8~'~ mice despite normal mucin production, as confirmed 
by anti-Muc2 antibody staining (Fig. 2d). Furthermore, there was no 
positive signal in the crypts of wild-type mice, but bacteria invaded the 
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Figure 2 | Penetration of the colonic epithelia by flagellated bactiera in 
Lypd8~'~ mice. a, H&E staining of Carnoy’s fixed colon sections. Scale 
bars, 100|1m. b, FISH using EUB338 probe (red) and DAPI (blue) of the 
Carnoy’s fixed colon. Scale bars, 20|1m. c, Extent of bacterial exclusion 
from the tip of epithelia to luminal bacteria in proximal, middle, and 
distal colon. (1 =9 wild-type (WT) mice; n= 10 Lypd8~/~ mice). 

d, Immunostaining with anti-Muc2 antibody (green), EUB338 (red) and 
DAPI (blue) of Carnoy’s fixed colon sections. Scale bars, 201m. e, EUB338 
(red) and DAPI (blue) in the crypt region of the colon. Scale bars, 201m. 
f, Quantitative PCR of bacterial DNA in 1 mg of colon tissue, using 
primers specific for each bacterial genus. Data show each bacterial DNA 
amount compared to wild-type mice group (n=6 per group). g, Bacterium 
forming the swarming colony (arrows). h, Colony-forming units (CFU) of 
P. mirabilis in colon tissues. i, FISH using Proteus-specific probe (green) 
and DAPI (blue) in the colon. j, Immunohistochemistry analysis of colon 
sections at the indicated time points after inoculation of CFSE-labelled 

P. mirabilis. P. mirabilis (green) and DAPI (blue). The representative of 
three independent experiments is shown. Scale bars, 201m. k, CFU of 

P. mirabilis in colon tissues at 8 h after the inoculation. Data are 

mean + s.d. *P< 0.05, **P< 0.01, ****P< 0.001. 


crypts of Lypd8&/~ mice (Fig. 2e). Thus, intestinal bacteria penetrated 
the inner mucus layer and further into the crypts in the large intestine 
of Lypd8~/~ mice. 

The bacterial composition of colonic luminal contents analysed by 
16S rRNA gene sequencing was not altered in Lypd8~/~ mice (Extended 
Data Fig. 4a). Therefore, we isolated DNA from the luminal content and 
colonic tissues and analysed for the presence of bacteria by PCR using 
universal primers for bacterial 16S rRNA (Extended Data Fig. 4b). 
Bacterial DNA was similarly amplified in luminal contents from wild- 
type and Lypd8~!~ mice. In wild-type colonic tissues, minimal DNA 
was detected. In contrast, bacterial DNA was substantially amplified 
in colonic tissues from Lypd8~'~ mice. We then identified the bac- 
terial species invading Lypd8‘~ epithelial cell layers using bacterial 
genus-specific primers (Fig. 2f). The numbers of several bacterial gen- 
era were increased in Lypd8/~ colonic tissues, particularly Proteus, 
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Figure 3 | High sensitivity to intestinal inflammation in Lypd8—'~ mice. 
a-c, Percentage of weight change after 2% DSS administration of wild-type 
(n=10) and Lypd8"'~ mice (n = 10) (a). H&E staining of colon tissues 

at day 8 of DSS administration (b). Scale bars, 501m. Survival rate (c). 

d, Quantitative PCR of bacterial DNA isolated from 1 mg of faeces from 
untreated, gentamicin (GM)-treated and vancomycin (VCM)-treated 


Helicobacter and Escherichia. Next, homogenized colonic tissues from 
wild-type and Lypd8~'~ mice were cultured on agar plates (Fig. 2g). 
Several bacterial colonies grew from wild-type and Lypd8'~ colonic 
tissues. Among them, unique colonies with a swarming migration 
pattern on agar were present in the Lypd8 ‘~ colonic tissue homogen- 
ates. DNA sequence analysis revealed that the bacterium forming the 
unusual swarming colony derived from Lypd8~‘~ tissues was Proteus 
mirabilis (Fig. 2h). We analysed the presence of Proteus spp. in colonic 
tissues by FISH using a Proteus-specific 16S rRNA probe (Fig. 2i). 
Proteus spp. were detected in the luminal regions of wild-type mice, but 
slightly increased numbers of Proteus spp. were observed in the luminal 
regions of Lypd8 '~ mice. Furthermore, Proteus spp. were present just 
above the epithelial cell layer and in the crypts of Lypd8~'~ mice. We 
administered a transanal injection of carboxyfluorescein succinimidyl 
ester (CFSE)-labelled P mirabilis to wild-type and Lypd8~'~ mice, and 
analysed the colonic tissues by immunohistochemistry at various time 
points (Fig. 2). In wild-type mice, P mirabilis was not observed adja- 
cent to epithelial cell layers, but was in close proximity to the epithelial 
cell layers of Lypd8~'~ mice 2h after administration. Furthermore, 
P. mirabilis was present in closer proximity to the epithelial layers of 
Lypd8 '~ mice at 8h (Fig. 2k). Despite the penetration of the epithelial 
surface by commensal bacteria, an inflammatory response (as deter- 
mined by inflammatory cytokine expression) was not evident in the 
colon of Lypd8~/~ mice reared in an SPF facility (Extended Data Fig. 5). 
Thus, intestinal bacteria such as Proteus, Helicobacter and Escherichia, 
all of which are Gram-negative rod bacteria possessing multiple fla- 
gella, penetrated the inner mucus layer and were present adjacent to 
the epithelial cell layers of Lypd8~/~ mice. 

Proteus and Helicobacter are associated with the pathogene- 
sis of inflammatory bowel diseases in both mice and humans’*"!°. 
Therefore, we analysed the sensitivity of Lypd8~'~ mice to intestinal 


Extent of bacterial 


GM vcm (¢) 


ie) GM VCM 


Wild type Lypds- 


mice. Data show the bacterial DNA amounts compared to untreated 
wild-type mice group (n =5 per group). ND, not detected. e, FISH using 
EUB338 probe (red) and DAPI (blue) in the colon of Lypd8~'~ mice 
untreated or orally treated with GM or VCM. Scale bars, 201m. f, Extent 
of bacterial exclusion from the tip of epithelia to luminal bacteria (n =7 
per group). Data are mean + s.d. *P < 0.05, ***P < 0.005, ****P < 0.001. 


inflammation. Administration of DSS to Lypd8~'~ mice induced severe 
intestinal inflammation with high mortality (Fig. 3a—c). Mice were 
then orally treated with gentamicin, which is active against Gram- 
negative bacteria including P mirabilis (Extended Data Fig. 6a). The 
faeces of Lypd8~'~ mice treated with gentamicin had decreased num- 
bers of Proteus, Escherichia and Helicobacter (Fig. 3d), and, accordingly, 
no bacteria were present just above the epithelial layers (Fig. 3e, f). 
Furthermore, gentamicin treatment ameliorated the intestinal inflam- 
mation in DSS-treated Lypd8~/~ mice (Extended Data Fig. 6b, d, f). 
In contrast, treatment with vancomycin, which is active against 
Gram-positive cocci but not P mirabilis (Extended Data Fig. 6a), did 
not induce changes in the inner layers of Lypd8~'~ mice (Fig. 3e, f). 
Furthermore, vancomycin treatment resulted in increased numbers 
of faecal Lactobacillus, Proteus, Escherichia and Helicobacter (Fig. 3d) 
and worsened DSS-induced intestinal inflammation in Lypd8~'~ mice 
(Extended Data Fig. 6c, e, g). Thus, mucus penetration by flagellated 
Gram-negative bacteria such as Proteus, Escherichia and Helicobacter 
correlates with increased sensitivity to intestinal inflammation in 
Lypd8~'~ mice. 

We next addressed the mechanisms by which Lypd8 inhibits bacte- 
rial invasion of the epithelial layer. Recombinant Lypd8 was incubated 
with faecal samples from wild-type mice (Fig. 4a). Lypd8 bound to 
a subpopulation of faecal bacteria. Then, faecal suspensions of wild- 
type and Lypd8~'~ mice were incubated with anti-Lypd8 mAb and 
analysed by flow cytometry and fluorescence microscopy (Fig. 4b, c). 
Lypd8* bacteria were present in faeces from wild-type mice. Next, we 
purified Lypd8-bound and unbound faecal bacteria by flow cytometry, 
and determined the bacterial genera present (Fig. 4d). Escherichia 
and Helicobacter were substantially enriched in Lypd8-bound bac- 
teria, indicating that Lypd8 preferentially associates with these flag- 
ellated bacteria. Flow cytometric analysis further showed that Lypd8 
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Figure 4 | Lypd8-dependent suppression of P. mirabilis attachment 
to epithelial cells. a, Flow cytometry of faecal bacteria incubated with 
Lypd8. b, c, Flow cytometry (b) and confocal microscopy (c) of faecal 
bacteria incubated with anti-Lypd8 antibody. Faecal bacterium, blue; 
Lypd8, red. Scale bar, 1 j1m. d, Quantitative PCR of Lypd8-bound and 
unbound bacteria. Data show the fold enrichment of DNA amount of 
Lypd8-bound versus Lypd8-unbound bacteria (n = 8 per group). ND, not 
detected in unbound fraction. e, The percentages of anti-Flag-antibody- 
bound bacteria incubated with Flag-tagged Lypd8. f, g, Immunoblot 
analysis with anti-Flag antibody of the supernatant and pellet of the 
mixture of Flag-tagged Lypd8 and flagella or bacterial bodies from 

P. mirabilis (f) and E.coli (g). h, i, ELISA assay of Lypd8 binding to 


bound to pure preparations of in-vitro-cultured P. mirabilis, but not 
non-flagellated bacteria such as Bacteroides sartrii, Lactobacillus 
acidophilus, Bifidobacterium breve and Enterococcus gallinarum (Fig. 4e 
and Extended Data Fig. 7a). However, Lypd8 exhibited no bactericidal 
activity against P mirabilis (Extended Data Fig. 7b). P. mirabilis was 
incubated with recombinant Lypd8 and analysed by immunogold scan- 
ning electron microscopy (Extended Data Fig. 7c). Immunogold-like 
spheres were observed on flagella of P. mirabilis treated with Lypd8. 
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flagellin (h) or flagella (i) of P. mirabilis. j,k, The mean CFU of 

E. gallinarum or P. mirabilis adhering to Caco-2 cells (n= 6) (j) and Caco-2 
cells pretreated with PNGaseF (k). Data are mean + s.d. and representative 
of three independent experiments. l-n, Schema of swarming motility 
assay of P. mirabilis (1). Circular swarms by P. mirabilis in the presence 

of polarized Lypd8-expressing Caco-2 cells at 13h. *, centre; +, edges of 
circular swarms. Representative photos (m) and the mean length of the 
circular swarms at the indicated time points (m= 5) (n). 0, p, Motility of 

P. mirabilis in semisolid agar with or without Lypd8. Representative photos 
(o) and the radii of motility halos at 4h (p). Data are mean + s.d. (n=6 
per group). *P< 0.05, **P < 0.01, ***P < 0.005, ****P < 0.001. 


Therefore, flagella and bacterial bodies were isolated from P. mirabilis 
(Extended Data Fig. 8a, b), incubated with a solution containing 
recombinant Lypd8, and pelleted down (Fig. 4f). After incubation 
with increasing flagella concentrations, the amount of recombinant 
Lypd8 was decreased in the supernatants, but elevated in the pellets. 
The incubation with bacterial bodies did not alter the amount of Lypd8. 
Similar observations were made when flagella and bacterial bodies were 
isolated from commensal E. coli (Fig. 4g). When a protein structurally 
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related to Lypd8 (LOC69864, encoded by 1810065E05Rik) was used, 
the protein amounts were not altered by incubation with flagella of 
P mirabilis, indicating that Lypd8 binds specifically to bacterial flagella 
(Extended Data Fig. 8c). The main component of flagella is flagellin. 
Therefore, we performed a quantitative enzyme-linked immuno- 
sorbent assay (ELISA) to analyse the association of Lypd8 both with 
flagellin and flagella (Fig. 4h, i and Extended Data Fig. 8d, e). Unlike 
TLRS, Lypd8 did not bind to flagellin, but associated with flagella in a 
dose-dependent manner. As Lypd8 interacts with flagella of live bacte- 
ria (Fig. 4a—e, Extended Data Fig. 7a), Lypd8 is suggested to recognize 
the higher order structure of the flagellum comprised of polymerized 
flagellin. As Caco-2 cells did not express endogenous LYPD8 (Extended 
Data Fig. 9a), we used Caco-2 cells with enforced expression of mouse 
Lypd8 for further analyses. P. mirabilis was added to cultures of polar- 
ized Caco-2 cells with or without Lypd8 expression, and after 8h the 
bacteria that had attached to Caco-2 cells were analysed (Fig. 4j). 
In Caco-2 cells expressing Lypd8, attachment of P mirabilis, but not 
E. gallinarum, was markedly diminished. PNGase F treatment of 
Caco-2 cells expressing Lypd8 increased the number of P. mirabilis, 
indicating the possible involvement of glycan in the inhibitory activity 
of Lypd8 (Fig. 4k). We added in-vitro-cultured P. mirabilis to the centre 
of lysogeny broth agar, then laid the polarized Caco-2 cell sheet with 
or without Lypd8 over the agar, and incubated for 5, 9 or 13h (Fig. 41). 
Swarming activity of P. mirabilis was severely impaired in the presence 
of Lypd8 (Fig. 4m, n). Because Lypd8 was secreted into culture media 
by Lypd8-expressing Caco-2 cells (Extended Data Fig. 9b), we inves- 
tigated whether secreted Lypd8 was able to inhibit bacterial motility. 
In-vitro-cultured P. mirabilis was added to the centre of semisolid agar 
containing recombinant Lypd§8, and incubated for 4 h (Fig. 40, p). In 
the presence of Lypd8, motility of P mirabilis was reduced. In addition 
to P mirabilis, commensal E. coli migrated into semisolid agar, but its 
migratory activity was inhibited in the presence of recombinant Lypd8 
(Extended Data Fig. 10). These findings indicate that secreted Lypd8 
binds to flagella and thereby inhibits bacterial motility in the intestinal 
lumen. 

Our results indicate that Lypd8 promotes the segregation of flagel- 
lated bacteria and colonic epithelia, thus reducing the risk of intestinal 
inflammation. The decreased LYPD8 expression seen in ulcerative 
colitis patients indicates that LYPD8 is involved in the pathogenesis 
of inflammatory bowel diseases. Whilst TLR5 is known to recognize 
flagellin and prevent intestinal inflammation by stabilizing micro- 
biota'”!8, Lypd8 recognizes and binds with flagella (polymerized flagel- 
lins) of live bacteria, reducing their motility and thereby regulating 
gut homeostasis. The functional association of Lypd8 and TLRS is an 
interesting future issue to be addressed. DSS has been shown to make 
the inner mucus layer permeable to bacteria‘. In the absence of Lypd8, 
the number of flagellated commensal bacteria, which invade the epithe- 
lia, increased after DSS treatment resulting in the severe inflammation. 
As enteric pathogenic bacteria harbour flagella, Lypd8 may also have 
a role in prevention of enteric infection. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 
Mice. C57BL/6] mice were purchased from Japan SLC (Shizuoka), and ICR mice 
and germ-free (IQI/Jic[Gf] ICR) mice were from CLEA Japan (Tokyo). Myd8s~! - 
and Asc~/~ mice backcrossed onto C57BL/6 mice for eight or more generations 
were used!?*!. All mice were kept under SPF conditions at the Experimental 
Animal Facility, Graduate School of Medicine, Osaka University. All animal experi- 
ments were performed according to guidelines of the Animal Research Committee 
of the Graduate School of Medicine at Osaka University. 
Quantitative RT-PCR. RNA samples were prepared from indicated tis- 
sues using TRIzol reagent (Invitrogen). Total RNA was reverse transcribed 
using Moloney murine leukaemia virus reverse transcriptase (Promega) and 
random primers (Toyobo) after treatment with RQ1 DNase I (Promega). 
cDNA was analysed by real-time RT-PCR using GoTaq qPCR Master Mix 
(Promega) in ABI 7300 (Applied Biosystems). Values were normalized to 
the expression of Gapdh, and the fold difference in expression relative to 
that of Gapdh is shown. The following primer sets were used: Lypd8, 5'-GC 
CTTCACTGTCCATCTATTT-3/ and 5’-GTGACCATAGCAAGACATGCA-3’; 
Villin, 5'-CTATGCAGATGGTACCTGTTC-3’ and 5’-CCTGGGA 
CGAGTCCTGGCCAA-3’; Muc2, 5’-ACATCACCTGTCCCGACTTC-3’ and 
5'-GAGCAAGGGACTCTGGTCTG-3’; Lgr5, 5‘-CATCACACTGTCACTG 
TGAGC-3’ and 5’‘-GGTAGCTGACTGATGTTGTTC-3’; Tnfa, 5’-TCCA 
GGCGGTGCCTATGT-3’ and 5’-CACCCCGAAGTTCAGTAGACAGA-3’; 
Il1b, 5'-TCAGGCAGGCAGTATCACTCA-3’ and 5’-GGAAGGTCC 
ACGGGAAAGAC-3’; Ifng, 5'- TCAAGTGGCATAGATGTGGAAGAA-3’ 
and 5'-TGGCTCTGCAGGATTTTCATG-3’; 16, 5'-CTGCAAGAGA 
CTTCCATCCAGTT-3’ and 5’-AAGTAGGGAAGGCCGTGGTT-3’; Cx3cl1, 
5'-GGCCGCGTTCTTCCATTTGT-3’ and 5’-TGATAGCGGATGAGCAAA 
GC-3'; Cxcl2, 5’-CTCAGTGCTGCACTGGTCCTG-3’ and 5’-CTGGGG 
GCGTCACACTCAAGC-3’; Ccl17, 5'-CTGCAGCATGCCAGAGCT-3’ and 
5!-GGTCTTATACCAGCTCAC-3’; Cel28, 5‘-CAGCCTCACCTGAGTCATTGC-3’ 
and 5‘-CAGTGCAACAGCTGGAGGCCA-3’; Gapdh, 5'-CCTCGTCCCG 
TAGACAAAATG-3/ and 5‘-TCTCCACTTTGCCACTGCAA-3’, 
In situ hybridization. Plasmids (pCRII; Invitrogen) containing a cDNA fragment 
of Lypd8 were used as templates of RNA probes. Digoxigenin (DIG)-labelled anti- 
sense or sense probes were prepared with T7 and SP6 RNA polymerase (Ambion), 
respectively, using the DIG RNA Labelling Mix (Roche Diagnostics). Colonic 
tissues of C57BL/6J were fixed with 4% paraformaldehyde (PFA). Serial frozen 
sections were fixed in 4% PFA for 20 min, incubated in cold 0.1% H2O:, and per- 
meabilized with 50,.g ml! proteinase K for 5 min. After an additional fixation 
with PFA, the sections were treated with acetic anhydrite in triethanolamine for 
5 min. The sections were then pre-hybridized with 50% formamide, 5 x saline 
sodium citrate, 1 mg ml”! yeast tRNA (Roche Diagnostics), 100 1g ml“! hepa- 
rin, 1 x Denhardt’s solution, 0.1% Tween 20 at 60°C for 3 h, and hybridized at 
60°C for 16h. After washing, sections were incubated with horseradish peroxide 
(HRP)-anti-DIG Fab fragment (clone 1.71.256: Roche Diagnostics), followed by 
biotin-labelled tyramide (TSA Biotin System; Perkin Elmer) for signal amplifi- 
cation. Hybridized probes were detected by ABC-Alkaline phosphatase (Vector 
Laboratories) and NBT/BCIP (Roche Diagnostics). Images were obtained using 
BZ-9000 (Keyence). 
Generation of Lypd8’"’ mice and Lypd8~'~ mice. Targeting vectors were con- 
structed by replacement of genomic fragment containing the third and fourth 
exons of Lypd8 with a Cre recombinase internal ribosome entry site Venus neo- 
mycin-resistance gene cassette (Lypd8’°*™’ mice) or a neomycin-resistance gene 
cassette (Lypd8~'~ mice), and a gene encoding HSV thymidine kinase driven by a 
phosphoglycerate kinase promoter inserted into the genomic fragment. After the 
targeting vector was transfected into V6.5 embryonic stem cells, G418 and ganci- 
clovir double-resistant colonies were selected and screened by PCR and Southern 
blot analysis. Homologous recombinants were used for generation of Lypd8*"" 
mice and Lypd8~/~ mice. Lypd8’*""’ mice and Lypd8~'~ mice were backcrossed 
onto C57BL/6 mice for at least six generations, and Lypd8"*™’ mice, Lypd8~'~ mice 
and their wild-type littermates from intercrosses of heterozygous mice were kept 
in the same cages and used for experiments. There was no randomization, but 
stratification was used to achieve the similar ages and sex rations among exper- 
imental groups. The experiments were not blinded. Flow cytometry of isolated 
colonic epithelial cells 

Intestinal epithelial cells were isolated from Lypd8’*""’ mice by shaking intes- 
tinal tissues in 5mM EDTA/HBSS solution at 37°C for 20 min and then washed 
with phosphate buffered saline (PBS). The intestinal epithelial cells were treated 
with PE-Cy7-conjugated anti-CD3e Ab (clone 145-2C11: BD Biosciences) in PBS 
containing 2% FBS to block nonspecific binding. Flow cytometric analysis was 
performed using a FACSCanto II flow cytometer (BD Biosciences) with FlowJo 
software (Tree Star). For cell isolation, cells were sorted using a FACSAria (BD 
Biosciences). 


= 


Generation of colonic epithelial cell lines stably expressing Lypd8. A sequence 
for Flag-tagged Lypd8 was constructed by inserting Flag-tag sequence into total 
Lypd8 coding sequence immediately downstream of predicted N-terminal sig- 
nal sequence. CMT-93 cells and Caco-2 cells, which were originally obtained 
from ATCC and free of mycoplasma, were transfected with linearized pcDNA3.1 
(+) vector (Invitrogen) inserted the sequence for Flag-tagged Lypd8 using 
Lipofectamine2000 (Invitrogen). These cells were cultured in G418-containing 
medium. The surviving cells were stained with anti-Flag M2 monoclonal antibody 
(cat F3165: Sigma-Aldrich) and Alexa Fluor 488 goat anti-mouse IgG antibody (cat 
A11001: Molecular Probes) and cells expressing Flag-tagged Lypd8 were sorted 
using FACSAria (BD Biosciences). 

Immunostaining of Caco-2 cell line cultured on transwell membrane. Caco-2 
cells stably expressing Flag-tagged Lypd8 (1 x 10° cells) were cultured on 
transwell filters of 3.0-\1m pore size (BD Biosciences) for 2 weeks. After con- 
firming full confluency, Caco-2 cells were washed with PBS, fixed with 4% 
PFA and then blocked by 1% bovine serum albumin (BSA) in PBS. Cells were 
stained with mouse anti-Flag M2 mAb (Sigma-Aldrich) plus Alexa Fluor 
488 goat anti-mouse IgG (Invitrogen) or rabbit anti-human Claudin-1 Ab 
(cat ab15098; Abcam) plus Alexa Fluor 594 goat anti-rabbit IgG (cat A11012; 
Invitrogen), and counterstained with DAPI (Vector Laboratories). Transwell 
filters with these cells were cut, placed on slide glasses and analysed using a con- 
focal microscope (FV1000-D; Olympus). The supernatants on Caco-2 cells with 
or without Lypd8 expression were incubated with anti-Flag M2 affinity gel (Sigma- 
Aldrich) at 4°C for 3h. The resin was collected, washed three times with Tris- 
buffered saline and then suspended in sample buffer for immunoblot analysis 
using anti-Flag M2 mAb (Sigma-Aldrich) and HRP-conjugated goat anti-mouse 
IgG (cat NA931: GE Healthcare). 

Assay for GPI cleaving activity. CMT93 cells stably expressing Lypd8 (1 x 10° 
cells) were rinsed twice with cold PBS and incubated with 0.5 ml of the same buffer 
containing 0.5 units of Bacillus cereus phosphatidylinositol-specific phospholi- 
pase C (Invitrogen) at 4°C for 20 min. These cells were stained with anti-Flag M2 
mAb (Sigma-Aldrich) and Alexa Fluor 488 goat anti-mouse IgG (Invitrogen). The 
surface expression of Lypd8 was analysed using a FACSCanto II (BD Biosciences). 
Generation of anti-Lypd8 antibody. Lypd8~/~ mice were immunized with P3U1 
cells with retroviral overexpression of Lypd8. Lymph node cells from immunized 
Lypd8~'~ mice were fused with Sp2/0 mouse myeloma cells. Hybridomas were 
screened by analysing reactivity to RBL1 cells overexpressing Flag-tagged Lypd8 
with flow cytometry. Positive clones were selected for further subcloning. Culture 
supernatants of subcloned hybridoma cells were screened by immunostaining of 
4% PFA- or Carnoy’s-fixed colon sections from wild-type and Lypd8~'~ mice. 
Culture supernatants of hybridoma clones (clone number 4F8 and 5A4), which 
stained Carnoy’s-fixed colon sections of wild-type mice, but not Lypds“! ~ mice, 
were used as mouse anti-mouse Lypd8 mAb. The specificity of mAb was also 
confirmed by flow cytometry analysis of CMT93 cells with or without Lypd8 
expression. Anti-mouse Lypd8 mAb was purified from the supernatant by Ex-pure 
Spin ProG (Kyoto monotech) and labelled with CF633 using Mix-n-Stain CF633 
antibody labelling kit (Biotium). 

Immunostaining and fluorescence in situ hybridization. Colons from 8- to 
12-week-old mice without washing were fixed in methanol-Carnoy’s fixative com- 
posed of 60% methanol, 30% chloroform and 10% acetic acid. Paraffin-embedded 
sections were dewaxed and hydrated. Sections were blocked with 1% BSA in PBS 
and stained with CF633-conjugated anti-mouse Lypd8 mAb or anti-Mucin2 Ab 
(clone H-300: Santa Cruz Biotechnology) and Alexa Fluor 594 goat anti-rabbit 
IgG (cat A11012: Invitrogen). Sections were incubated with 1 j1g Cy3- or Cy5- 
conjugated EUB338 (5'-GCTGCCTCCCGTAGGAGT-3’) for detection of all bac- 
teria or Cy3-conjugated pB-02110 (5/-ATGGGTTCATCCCATAGTGC-3/)"* for 
detection of Proteus spp. in 20011 of hybridization buffer (750mM NaCl, 100 mM 
Tris-HCl (pH 7.4), 5mM EDTA, 0.01% BSA, 10% dextran sulfate) at 40°C for 
16h. Sections were rinsed in wash buffer (50 mM NaCl, 4mM Tris-HCl (pH 7.4), 
0.02 mM EDTA), washed at 45°C for 20 min and counterstained with DAPI (Vector 
Laboratories). Human colons from the normal mucosa of resected colon tissues 
from colon cancer patients with no history of a diagnosis of inflammatory bowel 
diseases or the inflamed mucosa of resected colon tissues from ulcerative coli- 
tis patients were fixed in 4% PFA. The experiment was approved by the Ethical 
Committee of Osaka University School of Medicine. All participants provided 
informed consent. All ulcerative patients had a confirmed diagnosis by a gastro- 
enterologist. Paraffin-embedded sections were dewaxed and hydrated. Sections 
were blocked with 1% BSA in PBS and stained with anti-human Lypd8 Ab (clone 
V-16: Santa Cruz Biotechnology) and Alexa Fluor 568 donkey anti-goat IgG (cat 
A11057: Invitrogen). Normal goat IgG (cat AB-108-C: R&D systems) was used as 
an isotype control. Sections were analysed using a confocal microscope (FV1000-D; 
Olympus). The distance between bacterial populations and the epithelial surface 
was measured at four points of the proximal, middle and distal colon in each mouse. 
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Purification of Flag-tagged recombinant proteins. Recombinant Lypd8 or 
LOC69864 (an uncharacterized protein that is structurally most similar to 
Lypd8) was purified from CMT93 cells stably expressing FLAG-tagged Lypd8 or 
LOC69864 using FLAG M Purification Kit (Sigma-Aldrich). As a negative control, 
cells transfected with empty vector (pcDNA3.1 (+) (Invitrogen)) were used. 
Deglycosylation assay. Recombinant Lypd8 proteins (11g) were incubated with 
PNGase £, sialidase A and O-glycanase (Prozyme) at 37°C for 3 h. Recombinant 
Lypd8 proteins treated with glycanase were separated with SDS-PAGE and trans- 
ferred to polyvinylidene fluoride membranes (Millipore) that were incubated with 
anti-Flag M2 mAb (Sigma-Aldrich) and then HRP-conjugated goat anti-mouse IgG 
(cat NA931: GE Healthcare). Immunoreactivity was detected using SuperSignal 
(Thermo Scientific). 

Analysis of mutant Lypd8 (N-D) protein. The mutant Lypd8 N-D sequence 
was designed so that thirteen Asp (N) residues were converted to Asn (D). 293T 
cells were transfected with the mutant Lypd8 (N-D) expression vector using 
Lipofectamine 2000 (Invitrogen) and cell lysates of 293T cells expressing mutant 
Lypd8 (N-D) protein were separated with SDS-PAGE and analysed by western 
blot. 

Extraction of bacterial DNA. Faeces, luminal contents of the colon or colonic 
tissues were collected in tubes containing RNAlater (Ambion). After weights 
were measured, RNAlater was added to make tenfold dilutions of homogenates. 
Homogenates (200 11) of faeces, luminal contents or colonic tissues, or bacterial 
suspension containing sorted bacteria were washed twice with 1 ml PBS, 0.3 g glass 
beads (diameter, 0.1 mm) (BioSpec Products), 300 l Tris-SDS solution and 500 ul 
TE-saturated phenol were added to the suspension, and the mixture was vortexed 
vigorously using a FastPrep-24 (M.P. Biomedicals) at 5.0 power level for 30s. After 
centrifugation at 20,000g for 5 min at 4°C, 40011 of supernatants were collected. 
Subsequently, phenol-chloroform extraction was performed and 2501] of super- 
natants were subjected to isopropanol precipitation. Finally, DNAs were suspended 
in 200l TE buffer and stored at —20°C. 

Determination of microbiota by deep sequencing. PCR was performed 
using a primer set (784F, 5‘-AGGATTAGATACCCTGGTA-3’; and 1061R, 
5'-CRRCACGAGCTGACGAC-3’) targeting the V5-V6 region of the 16S rRNA 
genes with KAPA HiFi HotStart Ready Mix (KAPA Biosystems). Products were 
purified using DNA clean and Concentrator-5 (Zymo Research). Adaptor and 
barcode sequences were attached to the products by 10 cycles of PCR with 1 ng 
of each of initial PCR product as the template and primer sets and other PCR 
conditions were unchanged. Sequencing was performed using a 316 chip and Ion 
PGM Sequencing 400 Kit (Life Technologies) on the lon PGM sequencer (Life 
Technologies). Raw sequences were demultiplexed and quality-trimmed by the 
following procedures: (1) trimming bases with quality below Q15 from 3’ end 
of each read, (2) removing reads with average quality below Q20, (3) removing 
reads without primer sequences on both ends, and (4) removing reads with length 
shorter than 260 basepairs, using the FASTX-Toolkit (http://hannonlab.cshl.edu/ 
fastx_toolkit/index.html) and BBtrim (http://bbmap.sourceforge.net/). The pro- 
cessed sequences were then clustered into operational taxonomic units (OTU) 
defined at 94% similarity cutoff using UCLUST version 1.2.22q. Representative 
sequences for each OTU were classified taxonomically using RDP Classifier version 
2.2 with the SILVA 111 database. 

Quantitative PCR amplification of 16S rRNA genes. Quantitative 
PCR was performed in ABI7300 using GoTaq qPCR Master Mix 
(Promega). The following primer sets were used: ‘all bacteria, 5’-CGG 
TGAATACGTTCCCGG-3’ and 5’-TACGGCTACCTTGTTACGACTT-3’; 
Bacteroides, 5'-GAGAGGAAGGTCCCCCAC-3’ and 5’-CGCTACT 
TGGCTGGTTCAG-3’; Prevotella, 5‘- CACRGTAAACGATGGATGCC-3’ and 
5’-GGTCGGGTTGCAGACC-3’; Lactobacillus, 5'-TGGAAACA 
GRTGCTAATACCG-3’ and 5’'-GTCCATTGTGGAAGATTCCC-3’; 
Bifidobacterium, 5'-CTCCTGGAAACGGGTGG-3’ and 5‘-GGTGTTCTT 
CCCGATATCTACA-3’; Escherichia/Shigella, 5'-GAGTAAAGTTAAT 
ACCTTTGCTCATTG-3’ and 5’-GAGACTCAAGCTKRCCAGTATCAG-3’; 
Helicobacter, 5'-CTATGACGGGTATCCGGCC-3’ and 5’-TCGCCTTCG 
CAATGAGTATT-3’; Staphylococcus, 5'-TTTGGGCTACACACGTGCTAC 
AATGGACAA-3’ and 5’‘-AACAACTTTATGGGATTTGCWTGA-3’, 
Enterococcus, 5'-ATCAGAGGGGGATAACACTT-3’ and 5/- ACTCTCATCCT 
TGTTCTTCTC-3’; and Proteus, 5‘-GTTATTCGTGATGGTATGGG-3’ and 
5'/-ATAAAGGTGGTTACGCCAGA-3’. In the analysis of Lypd8-bound and 
unbound bacteria, all samples were normalized to the 16S rRNA gene level of 
‘all bacteria. 

Culture of colonic tissue homogenates. Colonic tissues were thoroughly washed 
twice and collected in tubes containing 5 ml PBS. After colonic tissues were homog- 
enized in PBS using a homogenizer, 10011 of the homogenates were incubated on 
lysogeny broth (LB) or MacConkey agar plates at 37°C in Ruskinn Bugbox Plus 
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anaerobic chamber (The Baker Company) for 24 h. The number of swarming col- 
onies on LB agar plates or white colonies on MacConkey agar plates was counted 
at 16 h after the start of incubation and colony-forming units were calculated. 
Inoculation of CFSE-labelled P. mirabilis. P. mirabilis was cultured in LB medium 
at 37°C until OD¢o0 reached 0.6. The bacterial culture was centrifuged at 8,000g for 
5 min and the pellet was resuspended in 1 ml PBS. The bacterial suspension was 
re-centrifuged and the pellet was labelled with CFSE Fluorescent Cell Labelling 
Kit (Abcam). After confirming CFSE labelling of P. mirabilis using a FACSCanto 
II (BD Biosciences), CFSE-labelled P. mirabilis was suspended with PBS (5 x 10° 
per ml) and the bacterial suspension (20011) was inoculated into anaesthetized 
mice via the transanal route with a catheter. At 1, 2, 4, and 8 h after inoculation, 
colons without washing were fixed in 4% PFA. Colon sections were counterstained 
with DAPI (Vector Laboratories), and analysed using a confocal microscope 
(FV1000-D; Olympus). 

Induction of DSS-induced colitis. Eight- to ten-week-old Lypd8~'~ mice and their 
littermate wild-type mice were used for DSS-induced colitis experiments. Acute 
colitis was induced by administration of 2% DSS (36-50 kDa; MP Biomedicals) in 
the drinking water for 5 days. Mice were analysed for changes in weight, survival 
rates and histological changes. 

Antibiotic treatment. Mice were treated with gentamicin (2 g] !; Nacalai Tesque) 
or vancomycin (500 mg1~!; Duchefa Biochemie B.V.) dissolved in autoclaved 
drinking water and provided for 2 weeks. Fluid intake was monitored. 

Assay of Lypd8 binding to faecal bacteria. Faeces were collected in tubes con- 
taining PBS. After weights were measured, PBS was added to make tenfold diluted 
suspensions. PBS-diluted faeces were mixed well and centrifuged at 400g for 
5 min to remove larger particles from bacteria. Supernatants (20011) were centri- 
fuged at 8,000g for 10 min to remove non-bound immunoglobulins. Pellets were 
resuspended in 1 ml PBS containing 2% FBS, and used as a bacterial suspension. 
Bacterial suspensions from wild-type mice and Lypd8/~ mice were incubated with 
anti-Lypd8 antibody. After washing with PBS containing 2% FBS, bacterial pellets 
were stained with Alexa Fluor 647 goat anti-mouse IgG1 (Invitrogen). Bacterial 
suspensions from C57BL/6] mice were incubated with recombinant Lypd8 protein 
(11g) on ice for 30 min and washed with PBS. Next, bacterial pellets were treated 
with anti-Flag M2 mAb (Sigma-Aldrich). Finally, bacterial pellets were stained with 
Alexa Fluor 488 goat anti-mouse IgG (Invitrogen). Bacterial suspensions were ana- 
lysed using a FACSCanto II (BD Biosciences) with FlowJo software (Tree Star) or a 
confocal microscope (FV1000-D; Olympus). Lypd8-bound and unbound bacteria 
were sorted using a FACSAria (BD Biosciences). 

Assay of Lypd8 binding to pure preparations of bacteria. Bacterial strains, 
Bacteroides sartrii JCM 17136!, Lactobacillus acidophilus JCM 11321, 
Bifidobacterium breve JCM 1192" and Enterococcus gallinarum JCM 8728", were 
obtained from Japan Collection of Microorganisms. P. mirabilis was isolated from 
the colonic tissue of Lypd8~/~ mice. All bacteria were cultured in Gifu anaerobic 
medium (Nissui) under anaerobic conditions. The pure preparation of each bac- 
teria was centrifuged at 8,000g for 5 min and the pellet was resuspended in 1 ml 
PBS containing 2% FBS. Bacteria were incubated with recombinant Lypd8 protein 
(11g) on ice for 1 h and washed with PBS. Next, the bacterial pellet was treated with 
anti-Flag M2 mAb (Sigma-Aldrich). Finally, the bacterial pellet was stained with 
Alexa Fluor 488 goat anti-mouse IgG (Invitrogen) and analysed using a FACSCanto 
II (BD Biosciences). 

Scanning electron microscopy with immunolabelling. Bacterial suspension 
was mixed with FLAG-tagged recombinant Lypd8 (rLypd8; 20ng,'), and then 
incubated for 1 h at 4°C. After the incubation, bacterial suspensions were cen- 
trifuged, and pellet was resuspended in PBS. A drop of bacterial suspension was 
placed on an 3-aminopropyltriethoxysilane (APS)-coated cover slip (Matsunami 
Glass IND), and blocked with bovine serum albumin and normal goat serum. The 
mounted bacteria were reacted with anti-FLAG M2 mAb (Sigma-Aldrich), washed 
with PBS, and reacted with 5-nm-gold-labelled anti-mouse IgG goat antibody 
(EY Laboratories). The sample was fixed with 2% glutaraldehyde and 1% OsOu, 
dehydrated in a graded ethanol series and 3-methylbutyl acetate, and dried in a 
critical-point drying chamber (HCP-1, Hitachi High-Technologies). It was then 
coated with a platinum layer approximately 1 nm thick in an ion sputter coater 
(E-1030, Hitachi High-Technologies), and examined under a scanning electron 
microscope (S-5000, Hitachi High-Technologies). 

Assay of Lypd8 binding to flagella. Bacterial suspension of P. mirabilis or E. coli 
JCM 1649" (Japan Collection of Microorganisms) in PBS was shaken 300 times 
per min for 60 min to remove flagella from bacterial bodies. Bacterial bodies were 
pelleted by centrifuging at 4,000g for 20 min. The supernatants were ultracentri- 
fuged at 80,000g for 60 min to obtain flagella. Bacterial bodies or flagella were mixed 
with the solution of FLAG-tagged recombinant Lypd8 (10 ngyl”!) or LOC69864 
(10 ngjl~'), and incubated for 3 h at 4°C. After the incubation, the bacterial bod- 
ies or flagella were pelleted by centrifugation or ultracentrifugation, respectively. 
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Then, the supernatant was collected and the pellet was resuspended in PBS. The 
supernatant and pellet suspension were separated with SDS-PAGE and transferred 
to polyvinylidene fluoride membranes (Millipore) that were incubated with anti- 
Flag M2 mAb (Sigma-Aldrich) and then HRP-conjugated goat anti-mouse IgG 
(cat NA931: GE Healthcare). Immunoreactivity was detected using SuperSignal 
(Thermo Scientific). 

Purification of recombinant flagellin protein of P. mirabilis. Bacterial 
DNA of P. mirabilis was extracted from the culture of P. mirabilis using 
a previously described protocol (see above). Using the extracted bacte- 
rial DNA as template, PCR was performed to isolate the flagellin gene with 
a primer set (5'-GGATCCGCACAAGTTATTAATACTAATTAT-3’ and 
5'-GCGGCCGCT TAACGTAACAGAGACAGAACAGT-3’) targeting the full- 
length flagellin gene of P. mirabilis. The isolated DNA was cloned in frame into 
EcoRI- and NotI-digested pGEX-6P-2 (GE Healthcare) to generate a glutathione 
S-transferase (GST)-flagellin fusion construct. A recombinant flagellin protein was 
prepared by growing E. coli harbouring the pGEX-flagellin plasmids in LB medium 
containing 100,gml of ampicillin at 37 °C until OD 09 reached 0.6. After growth, 
1mM isopropyl 8-p-1-thiogalactopyranoside (IPTG) was added to the medium, 
and bacterial culture was continued for 4 h at 37°C. Cells were centrifuged and 
the cell pellet was suspended in PBS with 1% Triton X-100. The cell suspension 
was sonicated 5 times for 30 s at 4°C, and cleared by centrifugation. The resulting 
solution was bound to Glutathione-Sepharose 4 Fast Flow (GE Healthcare). The 
beads were washed 3 times. The GST fusion proteins were then eluted with elution 
buffer (5mM glutathione, 50 mM Tris-HCl, pH 9.6). Elution buffer was replaced 
with PBS with Amicon Ultra-15 (Millipore). The amounts and purity of the protein 
were estimated by SDS-PAGE. 

ELISA binding assay. 96-well plates (Corning) were coated with 5\1g ml! GST 
peptides, 5.gml~! GST-tagged flagellin protein of P. mirabilis, 100 1g ml! flagella 
of P. mirabilis or PBS 1% BSA for 16h at 4°C. Plates were then washed, and increas- 
ing concentrations of mouse TLRS Fc protein (R&D systems) or FLAG-tagged 
Lypd8 protein diluted in PBS 1% BSA were added and incubated for 2h at room 
temperature. Plates were then washed, and 0.5,1g ml“! of HRP-conjugated goat 
anti-mouse IgG (cat NA931: GE Healthcare) or HRP-conjugated anti-FLAG M2 
mAb (Sigma-Aldrich) diluted in PBS 1% BSA was added and incubated for 2h at 
room temperature. Plates were then washed, and TMB substrate was added. Plates 
were then read at 450 nm with a spectrometer. 

In vitro growth assay of P. mirabilis. P. mirabilis (1 x 10°) were incubated in LB 
medium (1 ml) with the indicated concentrations of recombinant Lypd8 protein 
for 6h at 37°C. The bacterial cultures were applied to MacConkey agar plates and 
incubated at 37°C for 16 h, and the number of colonies was counted and colo- 
ny-forming units were calculated. 

Analysis of LYPD8 expression in Caco-2 cells and human colon epithelia. 
Human colon epithelia were obtained from the normal mucosa of resected colon 
tissues from colon cancer patients. Colonic epithelial layers were first stripped by 
shaking sections of colon in 5mM EDTA/HBSS. Total RNA was isolated from 
Caco-2 cells and human colon epithelia and reverse transcribed. CDNA was ana- 
lysed by real-time RT-PCR. Values were normalized to the expression of GAPDH, 
and the fold difference in expression relative to that of GAPDH is shown. 
The following primer sets were used: LYPD8, 5‘-GAACACTTTCATTT 
TGTAAGC-3’ and 5’-ACGACAGGAAGTTCCATTAGA-3’; GAPDH, 
5/-TGGATATTGTTGCCATCAATG-3’ and 5’-TGATGGGATTTCCATTGA 


TGA-3’. The experiment was approved by the Ethical Committee of Osaka 
University School of Medicine and informed consent for specimen use was 
obtained from all patients. 

Bacterial adhesion assay on Caco-2 cells. E. gallinarum or P. mirabilis (1 x 10°) 
was added to a culture of polarized Caco-2 cells. For PNGase F treatment, cells were 
irradiated with 30 Gy to stop cell growth and then incubated in culture medium 
containing PNGaseF (Sigma-Aldrich) (2 unit/ml) for 2h. At 8h after the addition 
of P. mirabilis, each well was washed with PBS three times and then Caco-2 cells 
were removed from transwell filters with trypsin/EDTA solutions and suspended 
in PBS. PBS containing Caco-2 cells were applied to LB or MacConkey agar plates 
and incubated at 37°C for 16h, and the number of colonies was counted and 
colony-forming units were calculated. 

Swarming motility assay of P. mirabilis. P. mirabilis was cultured in LB medium 
at 37°C until OD¢o0 was 0.6. LB agar plates (1.5%) were centrally inoculated with 
5 wl of bacterial culture and incubated at 37°C. After 1h, polarized Caco-2 cells 
with or without Flag-tagged Lypd8 expression, cultured on 75 mm diameter tran- 
swell filters of 3.0j1m pore size (BD Biosciences) for approximately 2 weeks, were 
added to P. mirabilis-inoculated agar plate so that the apical side of cells adhered to 
the surface of the agar plate. Motility was assessed by examining circular swarms 
formed by growing motile bacterial cells. The length from the centre to four points 
of the edge of circular swarms was measured at 5, 9 and 13h time points following 
P. mirabilis inoculation. 

Motility assay of P. mirabilis and E. coli in semisolid agar. CMT-93 cells (5 x 107 
cells) with or without Flag-tagged Lypd8 were lysed with 1 ml of CelLytic M Cell 
Lysis Reagent (Sigma-Aldrich). After removal of any insoluble material by cen- 
trifugation and filtration with a 0.45 1m filter, the cell lysates were incubated 
with 501] of anti-Flag M2 affinity gel (Sigma-Aldrich) for 3h. The resin was 
centrifuged and washed with Tris-buffered saline three times. The resin containing 
Flag-tagged Lypd8 was mixed with 1 ml of LB medium containing 0.3% agar. 
The final concentration of Lypd8 in LB agar was estimated by SDS-PAGE and 
was approximately 1.25,j.gml~!. P mirabilis and E. coli were cultured in LB medium 
at 37°C until OD¢09 was 0.6. Semisolid LB agar (0.3%) containing Flag-tagged 
Lypd8 were centrally inoculated with 0.5 11 of bacterial culture and incubated at 
37°C. Motility was assessed by examining circular migration. The radius of circles 
formed by bacterial migration was measured at 4h after the bacterial inoculation. 
Statistical analysis. Data are presented as mean + s.d., as indicated in the figure 
legends. Differences between control and experimental groups were evaluated 
using a two-tailed unpaired Student's t-test. For the swarming motility assay, a 
paired Student's t-test was performed. A P value of <0.05 was considered signifi- 
cant. For the determination of microbiota by deep sequencing, principal compo- 
nent analysis was used to visualize data sets by statistical programming language 
R 2.1.5. No statistical methods were used to predetermine sample size. No animal 
or sample was excluded from the analysis. 
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Extended Data Figure 1 | Lypd8 is a GPI-anchored glycoprotein. 

a, b, Quantitative RT-PCR analysis of Lypd8 expression in the colonic 
epithelia of ICR mice in specific-pathogen-free and germ-free conditions 
(a), and in colonic epithelia of wild-type, Myd88~/~ and Asc~'~ mice (b). 
Values were normalized to Gapdh. Data represent the mean + s.d. (n=3 
per group). NS, not significant. c, Confocal microscopic image of Caco-2 
cells expressing Flag-tagged Lypd8 cultured on transwell membrane, 


stained with anti-Claudin-1 antibody (red), anti-Flag antibody (green), 
and DAPI (blue). d, 293T cell lysates transiently expressing Flag-tagged 
Lypd8 and Flag-tagged mutant Lypd8 (N-D), in which thirteen Asp 
residues were converted to Asn, were separated with SDS-PAGE and 
immunoblotted with anti-Flag antibody. e, Flow cytometric analysis of 
CMT93 cells expressing Flag-tagged Lypd8 before (left) and after (right) 
treatment with PI-PLC. 
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Extended Data Figure 2 | Lypd8 is expressed in enterocytes. a, Structures 
of the Lypd8 gene (middle), the targeting vector (top) and the predicted 
Cre-IRES- Venus inserted gene (bottom). Grey box, coding exon; black 
box, non-coding exon. b, Flow cytometry of intestinal epithelial cells 
isolated from caecum and colon of Lypd8’*™s mice. Venus" epithelial 

cells (Lypd8*) are boxed. c, Colon sections of Lypd8’*™'S mice were 
counterstained with DAPI to visualize nuclei. Scale bar, 201m. Venus* 
Lypd8 was expressed at the uppermost layer of the colon epithelia. d, Flow 
cytometry of intestinal epithelial cells isolated from the colon of Lypd8’°""s 


mice stained for CD3. e, Lypd8-expressing colonic epithelial cells (black 
box in d, Lypd8*) and Lypd8 non-expressing colonic epithelial cells 

(red box in d, Lypd8~) are isolated and analysed for expression of 

Villin (highly expressed in enterocytes), Muc2 (selectively expressed in 
goblet cells) and Lgr5 (selectively expressed in intestinal stem cells) by 
quantitative RT-PCR. Lypd8* cells expressed Villin, but neither Muc2 nor 
Lgr5. Values were normalized to Gapdh expression. Data are mean + s.d. 
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Extended Data Figure 3 | Generation of Lypd8~/~ mice. a, Structures Ncol, separated by electrophoresis and hybridized with the radiolabelled 
of the Lypd8 gene (middle), the targeting vector (top) and the predicted probe shown in a. c, Quantitative RT-PCR analysis of Lypd8 expression in 
disrupted gene (bottom). Grey box, coding exon; black box, non-coding the colon of wild-type and Lypds"' ~ mice. The values were normalized to 
exon. b, Southern blot analysis of offspring from the heterozygote Gapdh expression. Data are mean + s.d. (n =3 per group). 


intercrosses. Genomic DNA was extracted from mouse tails, digested with 
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Extended Data Figure 4 | Bacteria are present in Lypd8~'~ epithelial 
cell layers. a, Principal component analysis of faecal microbiota in 
wild-type and Lypd8~'~ mice at genus level (n =8 mice per genotype). 
b, Bacterial DNA isolated from luminal contents and PBS-washed colon 
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tissues of wild-type and Lypd8~/~ mice was analysed by quantitative PCR 


using universal primers for bacterial 16S rRNA genes. Data show the 
total bacterial DNA amounts compared to wild-type mice group, and are 
presented as mean + s.d. (n= 6 per group). **P<0.01. 
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Extended Data Figure 5 | Inflammatory responses were not evident in Lypd8~'~ mice reared in an SPF facility. Expression of inflammatory cytokines 
and chemokines in the colon of wild-type and Lypd8~'~ mice. Data represent the mean + s.d. (n= 10 per group). 
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Extended Data Figure 6 | Gentamicin treatment ameliorated DSS- administration (d, e). Scale bars, 50,1m. Data are shown as mean + s.d. 
induced colitis in Lypd8 mice. a, A paper disc method was performed to *P < 0.05, ***P < 0.005. f, Lypd8~'~ mice (n= 10) and gentamicin- 
determine the sensitivity of gentamicin and vancomycin to P. mirabilis. treated Lypd8~/~ mice (n = 10) were orally administered 2% DSS for 
b-e, Wild-type (n = 8) and Lypd8~'~ mice (n= 8) were orally administered _ 5 days and their survival rate was monitored. g, Lypd8~'~ mice (n = 10) 
gentamicin (b, d) or vancomycin (c, e) for 2 weeks and then administered and vancomycin-treated Lypd8~'~ mice (n= 10) were orally administered 
DSS for 5 days. Percentage of weight change (b, c). H&E staining of 2% DSS for 5 days and their survival rate was monitored. Data show 
colon tissues at day 8 (gentamicin) and day 5 (vancomycin) of DSS survival rate of each group. 
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P. mirabilis 
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Extended Data Figure 7 | Lypd8 preferentially bound to in-vitro- 
cultured P. mirabilis. a, Representative dot plots of pure preparations 
of in-vitro-cultured B. sartrii, L. acidophilus, B. breve, E. gallinarum 
and P. mirabilis incubated with Flag-tagged Lypd8. Numbers within 
plots indicate percentages of bacteria bound by anti-Flag antibody in 
the respective areas. b, P. mirabilis was cultured in LB medium with the 
indicated concentrations of recombinant (r)Lypd8 protein for 6 h at 


@.0@V x80.0kK  3¢a5nm 


37°C. The mean CFU of cultured P. mirabilis are shown (n =4 per group). 
t s.d. ¢, In-vitro-cultured P. mirabilis were incubated with 
or without Flag-tagged Lypd8 and reacted with indicated 1st and 2nd 
antibodies. Scanning electron microscopic images of P. mirabilis labelled 
for immunogold detection of the bound Lypd8 were shown. White arrows 
indicate gold-particles. Scale bars, 300 nm. 
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Extended Data Figure 8 | Isolation of flagella of P. mirabilis and 
E.coli. a, b, Bacterial flagella were isolated from bacterial suspension 

of P. mirabilis and E.coli by vigorous shaking and ultracentrifugation. 
Transmission electron microscopic images of flagella fraction and 
bacterial bodies fraction from P. mirabilis (a) and E.coli (b) are shown. 
Flagella and bacterial bodies were visualized by negative staining. Scale 
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bars, 2.5 41m. c, After incubation of Flag-tagged LOC69864 protein 
(10ng,l~') with flagella of P. mirabilis (1 xg l~'), flagella were pelleted 
by ultracentrifugation. The supernatant (sup) and pellet suspension were 
separated by SDS-PAGE and immunoblotted with anti-Flag antibody. 

d, e, Association of mouse TLR5-Fc chimaera protein and flagellin (d) or 
flagella (e) of P. mirabilis was analysed by ELISA assay. 
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Extended Data Figure 9 | LYPD8 expression in Caco-2 cells and human colon epithelia. a, Quantitative RT-PCR analysis of LYPD8 expression in 
Caco-2 cells and human colonic epithelia isolated from normal parts of colonic tissues from colon cancer patients (n = 3). b, Immunoblot analysis for 
detecting Flag-tagged proteins in the supernatants on Caco-2 cells with or without Flag-tagged Lypd8. 
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Extended Data Figure 10 | Motility of E. coli was inhibited in semisolid agar containing Lypd8 protein. a, Representative images of E. coli motility in 
semisolid agar with or without Lypd8 protein 4h after incubation. b, The radii of the motility halos by E. coli were measured at 4h after incubation. Data 
are mean + s.d. (n =6 per group). ****P < 0.001. 
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Cerebral cavernous malformations arise from 
endothelial gain of MEKK3-KLF2/4 signalling 


Zinan Zhou', Alan T. Tang!*, Weng-Yew Wong”, Sharika Bamezai!, Lauren M. Goddard!, Robert Shenkar’, Su Zhou!, 
Jisheng Yang', Alexander C. Wright*, Matthew Foley’, J. Simon C. Arthur®, Kevin J. Whitehead’, Issam A. Awad, Dean Y. Li”, 


Xiangjian Zheng”? & Mark L. Kahn! 


Cerebral cavernous malformations (CCMs) are common inherited 
and sporadic vascular malformations that cause strokes and seizures 
in younger individuals’. CCMs arise from endothelial cell loss of 
KRIT1, CCM2 or PDCD10, non-homologous proteins that form an 
adaptor complex”. How disruption of the CCM complex results in 
disease remains controversial, with numerous signalling pathways 
(including Rho**, SMAD*° and Wnt/-catenin®) and processes such 
as endothelial-mesenchymal transition (EndMT)° proposed to 
have causal roles. CCM2 binds to MEKK3 (refs 7-11), and we have 
recently shown that CCM complex regulation of MEKK3 is essential 
during vertebrate heart development!”. Here we investigate this 
mechanism in CCM disease pathogenesis. Using a neonatal mouse 
model of CCM disease, we show that expression of the MEKK3 
target genes KIf2 and Ki/f4, as well as Rho and ADAMTS protease 
activity, are increased in the endothelial cells of early CCM lesions. 
By contrast, we find no evidence of EndMT or increased SMAD or 
Wnt signalling during early CCM formation. Endothelial-specific 
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Figure 1 | KLF2, KLF4 and ADAMTS4 are increased in the earliest 
CCM lesions. a, b, Early CCM lesions (arrow) appear as haemorrhagic, 
dilated venules in the white matter (WM) of the cerebellum. GL, granular 
layer. Scale bars, 1 mm (white) and 50j1m (black). c, Mature CCM lesions 
are detected as blood-filled caverns in the hindbrain of P11 Krit1®°X° 
(iECre;Kritl") animals. Scale bars, 1 mm (white) and 500 1m (black). 

d, Quantitative PCR (qPCR) analysis reveals increased K/f2, KIf4 and 
Adamts4 expression in hindbrain endothelial cells from Krit1®°*° mice. 
**P < 0.01; ***P < 0.001 (unpaired two-tailed Student’s t-test). Error bars 
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loss of Map3k3 (also known as Mekk3), KIf2 or KIf4 markedly 
prevents lesion formation, reverses the increase in Rho activity, and 
rescues lethality. Consistent with these findings in mice, we show 
that endothelial expression of KLF2 and KLF4 is increased in human 
familial and sporadic CCM lesions, and that a disease-causing 
human CCM2 mutation abrogates the MEKK3 interaction without 
affecting CCM complex formation. These studies identify gain of 
MEKKS3 signalling and KLF2/4 function as causal mechanisms 
for CCM pathogenesis that may be targeted to develop new CCM 
therapeutics. 

To understand the cellular and molecular mechanisms that under- 
lie CCM formation, we first examined the temporal course of lesion 
formation in mice with induced, endothelial-specific deletion of Krit1 
immediately after birth (iECre;Kritl, hereafter termed Krit]®“*° mice). 
Vascular malformations were first detected at postnatal day 6 (P6) as dilated 
venules in the cerebellar white matter of Krit1®“*° mice, with numerous 
mature lesions present by P11 (ref. 13) (Fig la—c and Extended Data Fig. 1). 
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indicate s.e.m. (n= 4). e, Abluminal ADAMTS-cleaved versican (DPEAAE) 
and nuclear KLF4 are detected in nascent (P6) and mature (P11) CCM lesions 
(arrows). Scale bars, 50m. f, Increased levels of ADAMTS4 detected by 
immunoblotting of P11 Krit1®°S° hindbrain lysate. Results are representative 
of 3 separate experiments. GAPDH immunoblot shown for loading control. 

g, Increased KI/f2 expression in CCM lesions (arrows) and meningeal vessels 
of Krit1®°®° animals detected using in situ hybridization. Areas in boxes 

(left) are shown at higher magnification (right). Scale bars, 500 1m (left) and 
100m (right). Dotted lines (a, b, e) outline cerebellar white matter. 


Krit1 


iECre;Krit1#/ 


1Department of Medicine and Cardiovascular Institute, University of Pennsylvania, 3400 Civic Center Blvd, Philadelphia, Pennsylvania 19104, USA. @Laboratory of Cardiovascular Signaling, 
Centenary Institute, Sydney, New South Wales 2050, Australia. 7Neurovascular Surgery Program, Section of Neurosurgery, Department of Surgery, The University of Chicago Medicine and 


Biological Sciences, Chicago, 


llinois 60637, USA. “Department of Radiology, University of Pennsylvania Medical Center, 3400 Spruce Street, Philadelphia, Pennsylvania 19104, USA. °Sydney 


Microscopy & Microanalysis, University of Sydney, Sydney, New South Wales 2050, Australia. ®Division of Cell Signaling and Immunology, University of Dundee, Dundee DD1 5EH, UK. Division of 


Cardiovascular Medicine and 


he Program in Molecular Medicine, University of Utah, Salt Lake City, Utah 84112, USA. 8The Key Laboratory for Human Disease Gene Study of Sichuan Province, 


Institute of Laboratory Medicine, Sichuan Academy of Medical Sciences & Sichuan Provincial People’s Hospital, Chengdu, Sichuan 610072, China. °Faculty of Medicine, Sydney Medical School, 


University of Sydney, Sydney, New South Wales 2050, Australia. 
*These authors contributed equally to this work. 


122 | NATURE | VOL 532 | 7 APRIL 2016 


© 2016 Macmillan Publishers Limited. All rights reserved 


Krit 1/4 iECre;Krit1™/ ;Map3k3 "+ 


-—. = 


LETTER 


b iECre;Krit1" iECre;Krit1/ ;Map3k3t/+ 


P6 
0.8 0.5 THREE Fem kK CO Kit 
© 06 5 20 Mm M M te iECre;krit1™”" 
£ g mm iECre;Krit?”"-Map3ka* 
@ 0.4 515 
€ 3 
3 02 2 1 
> o 
0.0 Oe-eeoo gz o8 
0 
« wa a fl/fl 
H iECre;krit1™ Lah \ bk i) 2 
© i€Cre;krit™;Map3aks”* = s s s r RS . 
ppp ps 
e f 
100) ™“ KLF2 xxx  KLFA 
S naa g 60 ie g 40 
= B48 % 3.0 7: 
B 50 — iECre;Krit1™”" a” a 
= + iECre;Krit1";Map3k3"* © 99 o 
Q 2, g 
D a ag, LO 
oO oO 
0 aco a ie} T 
0 20 40 60 
(Scr. mm@siKRIT1 mm siKRIT1 +HF 
Daysaites bial [ siKRIT1+tempol Ea siKRIT1 + VitD3 
g Krit 1/1 iECre;Krit1 ‘4 iECre;Krit1/-Map3k3™/*+ 
o) 
peat | 
= 
<<. 
= 
Ke 
Oo 
Ww 
a 


Figure 2 | Genetic rescue of CCM formation and increased Rho activity 
with loss of MEKK3. a, Visual detection of CCM lesions in the hindbrains 
of P11 Kritl“" (top left), Krit1®°®° (iECre;Kritl!"; bottom left), and 
Map3k34e8SQ (iECre;Krit !"; Map3k3” *; right) mice (right). Scale bars, 
1mm. b, Composite microCT images of Krit1®°®° and Map3k3'e®SQ 
hindbrains. CCM lesions are shown in red. c, Quantification of CCM 
lesion volumes (n= 5 for each group). d, qPCR analysis of gene expression 
in cerebellar endothelial cells (n =4). e, Postnatal survival of Krit1®CX° 
animals with and without endothelial loss of one Map3k3 allele. P=0.0009. 


We recently demonstrated that loss of KRIT1, CCM2 or PDCD10 in 
endothelial cells of the developing mouse heart upregulates expression 
of KLF2, KLF4, ADAMTS4 and ADAMTSS owing to increased activ- 
ity of the MEKK3 signalling pathway'”. Analysis of isolated cerebellar 
endothelial cells from neonatal Krit1*“*° and littermate control mice at 
P6 revealed increased Adamts4, but not Adamts5, messenger RNA and 
protein, in addition to increased levels of both K/f2 and K/f4 (Fig. 1d, f). 
ADAMTS4 cleaves the proteoglycan versican to expose a neo-epitope 
(DPEAAE) that was detected immediately adjacent to the endothelial 
cells of both early and late CCM lesions (Fig. 1e). Increased levels of 
nuclear KLF4 protein and K/f2 mRNA were also detected in the endothe- 
lial cells of CCM lesions and other vessels in the cerebellum (Fig. le, g). 
These findings reveal increased levels of KLF2, KLF4 and ADAMTS4 
during the earliest phase of CCM lesion formation in vivo. 


f, qPCR analysis of gene expression in human umbilical vein endothelial 
cells (HUVECs) treated with scrambled (Scr.) or KRIT1-targeting (sikRIT1) 
short interfering RNAs (siRNAs), alone and in the presence of the indicated 
Rho antagonists (n = 3). HE, hydroxyfasudil. g, Normalization of Rho 
activity with loss of MEKK3. PECAM and pMLC staining of white matter 
vessels in the indicated P6 littermate brains. Images in all panels are 
representative of 5 independent studies for each genotype. Scale bars, 50 jum. 
*P< 0.05; **P< 0.01; ***P < 0.001; ****P < 0.0001 (unpaired two-tailed 
Student's t-test). Error bars indicate s.e.m. 


Recent studies have implicated numerous signalling mechanisms 
as causal for CCM formation, including the Rho**"4, TGF-8/BMP®, 
Wnt/3-catenin® and Notch'® pathways. To determine whether changes 
in Rho activity are an early event in CCM pathogenesis, we examined 
the ROCK substrate phosphorylated-myosin light chain (pMLC). 
pMLLC levels were markedly increased in the brain capillary and 
venous endothelial cells of P6 neonatal Krit1®-S° mice compared 
with littermate controls, including those lining the earliest detectable 
CCM lesions (Extended Data Fig. 2a). TGF-6/BMP signalling through 
SMAD1 and SMAD3 and a mechanism of EndMT has been proposed 
to explain gene expression changes in KRIT1-deficient brain endothe- 
lial cells, including increased KLF4 (refs 5, 16). A recent study by the 
same group suggested that increased (-catenin signalling is a primary 
event that culminates in later EndMT°. To assess endothelial 8-catenin 


7 APRIL 2016 | VOL 532 | NATURE | 123 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


b iECre;Krit1/ 


iECre;Krit1"/" iECre;Krit1™ ;Kif2t/+ — iECre;Krit1"/";Kif2t/ iECre;Krit1"/" ;Kif4f/+ 


iECreskrit1™” ieCrekritq 
tek o +e BB iCre;krit7 


08 > ieCreykrit1"” Kuro" 0.8 > iECre;Krit1™ Kira” 
ee : O iéCresKritt™" Kio . 
= 06 
E —€ a 
= E041 —— 
i= . 


Krit 1/41 iECre;Krit1/ 


Figure 3 | Genetic rescue of CCM formation and increased Rho activity 
with loss of KLF2 and KLF4. a, b, Visual appearance of CCM lesions in 
Krit1®°S° (iECre;Kritl™"), KIf28°®S2 (iECresKrit"KIf2/*), KIfghomoRsQ 
(iECre;Krit "KIf2"") and KIf4F "882 (jECre;Krit ";KIf4"*) mice (top), 
and composite microCT images of the same hindbrains (bottom). Scale 
bars, 1mm. c, d, Volumetric quantification of CCM lesions in Krit1 ese) 
littermates, with and without endothelial loss of one or two K/f2 alleles 

(c) or one Kif4 allele (d) (n=5-7). **P < 0.01; ***P < 0.001 (unpaired 
two-tailed Student’s t-test). e, Rescue of increased Rho activity by loss of 
KLF2. PECAM and pMLC staining of white matter vessels in the indicated 
P6 littermate brains is shown. Images in all panels are representative of at 
least 4 independent studies for each genotype. Scale bars, 501m. 
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signalling during CCM lesion formation, we generated neonatal 
Krit1®°X° mice on a TCF/Lef:H2B-GFP Wnt/8-catenin reporter 
background'’. Immunostaining for green fluorescent protein (GFP) 
revealed (-catenin signalling in white matter vascular endothelium that 
was not increased in P6 CCM lesions (Extended Data Fig. 2b), while the 
levels of both GFP protein and the 8-catenin target genes Axin2 and Lef1 
were unchanged in cerebellar endothelial cells isolated from P6 or P11 
neonatal Krit1®°®° mice (Extended Data Fig. 2c, d). Immunostaining 
of brain sections and immunoblotting of cerebellar endothelial 
cell protein also revealed no changes in pSMAD3 (Extended Data 
Fig. 2e, f). Finally, analysis of cerebellar endothelial gene expression 
analysis revealed no change in the expression of Notch target genes at P6, 
although an increase in Hes1 was noted at P11 (Extended Data Fig. 3). 
These studies reveal that primary CCM lesion formation is associ- 
ated with increases in K/f2, Klf4 and Adamts4 expression and Rho/ 
ROCK activity, but not in TGF-8/BMP, Wnt/8-catenin or Notch 
signalling. 

The above findings suggested that changes in KLF2/4 and 
ADAMTS4 expression may be causal for CCM formation. The CCM 
complex directly binds MEKK3 (refs 7-11), a MAP3 kinase known 
to regulate KLF2 and KLF4 expression in the embryonic heart in 
cultured endothelial cells!*, and we previously found that Map3k3 
haploinsufficiency rescues the loss of cardiac jelly associated with 
endocardial loss of CCM signalling”. Map3k3 haploinsufficiency was 
also found to rescue the early embryonic lethality conferred by pan- 
endothelial loss of KRIT1 (Extended Data Fig. 4a and ref. 18), suggest- 
ing that excess endothelial MEKK3 signalling may have a broad role in 
the cardiovascular phenotypes associated with loss of CCM signalling. 

To determine whether this model underlies CCM formation, we 
generated iECre;Krit l“"; Map3k3!* mice (termed Map3k3#*'®SQ 
mice). Visual inspection of the hindbrains of P11 Map3k3#*®82 mice 
compared with neonatal Krit1™C*° littermate controls revealed a 
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notable reduction in the number and size of vascular lesions (Fig. 2a). 
To quantify CCM formation, we imaged P11 hindbrains using 
contrast-enhanced, high resolution X-ray micro-computed tomography 
(microCT), and measured actual lesion volumes using semi-automated 
software. Map3k3"*®S2 mouse hindbrains exhibited nearly complete 
prevention of the lesion phenotype compared with neonatal Krit]®C®° 
littermates, as assessed by hindbrain microCT imaging (Fig. 2b), 
blinded measurement of total CCM lesion volume (Fig. 2c), and 
measurements of K/f2, Kif4 and Adamts4 in P6 cerebellar endothelial 
cells (Fig. 2d). While almost all neonatal Krit1®K° mice were dead by 
P30, all Map3k3#t88Q animals remained alive (Fig. 2e), and exhibited 
normal growth and development. Finally, partial loss of MEKK3 also 
fully rescued CCM lesion formation in Com2"CK° (iECre;Com2™") 
mice (Extended Data Fig. 4b). These genetic findings support the con- 
clusion that CCM lesions arise from gain of MEKK3 signalling and 
altered downstream gene expression in the endothelium. 

Increased endothelial pMLC is coincident with increased K/f2, KIf4 
and Adamts4 expression in the earliest CCM lesions (Fig. le, g and 
Extended Data Fig. 2a), suggesting either that changes in Rhno/ROCK 
activity are downstream of changes in MEKK3 activity or vice versa. The 
Rho-inhibiting agents hydroxyfasudil, tempol and vitamin D3 (ref. 19) 
failed to reverse the increase in KLF2 and KLF4 expression conferred by 
loss of KRIT1 in cultured human endothelial cells (Fig. 2f), suggesting 
that Rho is not upstream of the KLF2/4 expression changes associated 
with loss of CCM function. By contrast, P6 Map3k3""8S mice exhib- 
ited a complete normalization of endothelial pMLC staining (Fig. 2g), 
indicating that increased Rho activity arises secondary to increased 
MEKK3 signalling during CCM formation. 

To test the roles of KLF2 and KLF4 in CCM pathogenesis, we meas- 
ured lesion formation in Kif2et8sQ (iECre;Krit !",KIp2Y 7), K]fphomorse 
(iECre;Krit ™",K1f2"") and KIfa8'®8Q (GECre;Krit "KIf4”*) mice 
compared with littermate neonatal Krit®“®° controls at P11. KIf2H*®S2 
and KIf4!*8SQ mice exhibited a marked but incomplete prevention of 
lesion formation (80% and 75% rescue for KIf24S and KIf4iet®SQ 
mice, respectively) based on visual inspection of cerebellar lesions 
and quantification of CCM lesion volume after microCT imaging 
(Fig. 3a-d). Remarkably, K/f2"°™°8S2 mice exhibited 99% rescue 
(Fig. 3a, c), with only a small amount of venule dilatation visible histo- 
logically (Extended Data Fig. 5). pMLC staining was normalized in P6 
KIf288SQ mice (Fig. 3e), indicating that increased Rho/ROCK activity 
arises secondary to increased KLF2 expression. These findings identify 
gain of KLF2 and KLF4 function as causal for CCM formation, and 
suggest that these transcription factors are the primary downstream 
targets of MEKK3 in this disease model. They also highlight the notable 
molecular conservation of this endothelial pathway: from zebrafish to 
mammals, and from embryonic vascular endothelium and endocar- 
dium to postnatal brain endothelium. 

To determine whether MEKK3-KLF2/4 signalling is increased in 
human CCMs, we examined resected lesions from two patients with 
familial CCM bearing KRIT1 and PDCD10 germline mutations, and 
two sporadic CCM patients lacking any previous genetic or molec- 
ular data. Markedly increased nuclear KLF2 and KLF4 expression 
was observed in the endothelial cells of both familial and sporadic 
human CCM lesions (Fig. 4a, b), consistent with increased MEKK3 
signalling and studies performed using the mouse model. MEKK3 
binds CCM2 through the carboxy-terminal helical harmonin domain 
of CCM2, and CCM2 truncation mutants lacking this domain do 
not bind MEKK3 (Extended Data Fig. 6a, b and refs 10, 11, 20, 21). 
A literature search identified a familial CCM patient with a four- 
nucleotide duplication in the last exon of CCM2 (CCCTdup) pre- 
dicted to delete most of the helical harmonin domain”? (Fig. 4c). 
CCM2 CCCTdup expressed normally in HEK293T cells and bound 
KRIT1 and PDCD10 in a manner indistinguishable from wild-type 
CCM2, but failed to interact with MEKK3 (Fig. 4c—e and Extended 
Data Fig. 6c). These results suggest that specific disruption of the 
CCM2-MEKK:3 interaction is sufficient to cause familial CCM 
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Figure 4 | Human CCMs exhibit high levels of endothelial KLF2 and 
KLF4 and arise owing to selective loss of CCM2-MEKK3 interaction. 
a, b, Immunostaining for PECAM plus either KLF2 (a) or KLF4 (b) is 
shown for cerebral vessels in individuals without CCM disease (control; 
left), for CCM lesions arising owing to germline mutations in KRIT1 

or PDCD10 (familial; middle), and for sporadic CCM lesions from two 
individuals (sporadic; right). Arrows indicate nuclear KLF2-high and 
KLF4-high endothelial cells in the CCM lesions. Red asterisks indicate 
fluorescence due to trapped intravascular erythrocytes. Scale bars, 50m. 


disease, and that human CCMs also arise due to loss of MEKK3 reg- 
ulation and increased expression of KLF2 and KLF4. 

How gain of MEKK3-KLF2/4 signalling confers CCM formation 
is unclear. It has been proposed that EndMT underlies CCM patho- 
genesis”, but we detect no evidence of a change in phenotype from 
endothelial to mesenchymal with loss of CCM signalling, and demon- 
strate that the loss of the non-mesenchymal transcription factor KLF2 
is sufficient to rescue CCM formation fully. Our studies identify two 
effector pathways downstream of MEKK3-KLF2/4 signalling that may 
drive early CCM pathogenesis: Rho signalling and ADAMTS proteo- 
lytic activity. Increased Rho activity has been linked to increased stress 
fibre formation, loosened junctions and decreased tube formation in 
cultured endothelial cells*'***, and loss of vascular integrity in mice*”®, 
but whether and how these changes might cause CCM formation is not 
yet clear. Increased ADAMTS activity is conferred by gain of KLF2 and 
KLF4 function (Extended Data Fig. 7), and may confer CCM formation 
through breakdown of a proteoglycan matrix that is required specifi- 
cally for the CNS vasculature (for example, versican; Extended Data Fig. 
1b). A proteolytic mechanism would also explain the autosomal dom- 
inant inheritance of this disease, as CCM-deficient endothelial cells 
generated by a rare second-hit mechanism**”’ could degrade the matrix 
supporting an entire vessel, thereby creating a cavernous malformation. 

Finally, these studies may help to identify new therapies for CCM 
disease. Since MEKK3 upregulates KLF2 and KLF4 through the 
MEKS and ERK5 downstream MAPKs!*”5, we evaluated the effects of 
available inhibitors on CCM formation. BIX02189 (anti- MEK5) and 
XMD17-109 (anti-ERK5) reversed the increase in KLF2 and KLF4 
expression associated with loss of CCM signalling in cultured endothe- 
lial cells (Extended Data Fig. 8), but failed to affect CCM formation 
at the low doses tolerated by neonatal Krit1 ECKO mice (not shown). 
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c, Schematic representation of the CCM2 CCCT duplication (CCCTdup). 
HH, helical harmonin domain; PTB, phosphotyrosine-binding domain. 

d, CCM2 CCCTdup does not bind MEKK3. The indicated proteins 

were expressed in HEK293T cells, immunoprecipitated (IP) using 
anti-haemagglutinin (HA) antibodies, and immunoblotted (IB) using 

the antibodies shown. e, CCM2 CCCTdup binds KRIT1 and PDCD10 

like wild-type CCM2. The indicated proteins were expressed and co- 
immunoprecipitated as described in c. Results shown are representative of 
at least 3 separate experiments. 


Therapies targeting Rho (for example, fasudil”® or other recently iden- 


tified agents’) or the ADAMTS proteases may prove more effective, 
but future studies that rigorously test the causal role of these putative 
downstream effectors remain essential for the rational development of 
CCM disease therapies. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Mice. VE-cadherin CreERT2 (‘iECre’) transgenic mice were a gift from R. Adams*®. 
Tie2, Kritl™", Coma, Map3k3, Kips and Kia animals have been pre- 
viously described!>3!-34, The TCF/LEF:H2B-GFP reporter line was obtained 
from Jackson Laboratories (013752)!”. All intercrossed animals were maintained 
on mixed-strain backgrounds. Breeding pairs between 2 and 8 months of age were 
used to generate the neonatal CCM mouse model pups. Male and female animals 
were used in equal numbers for all experiments. The University of Pennsylvania 
Institutional Animal Care and Use Committee approved all animal protocols, and 
all procedures were performed in accordance with these protocols. 

Induction of the neonatal CCM mouse model. Pups were intragastrically injected 
with 25 1g 4-hydroxytamoxifen (Sigma Aldrich H7904) at intervals of 1, 3 and 5 days 
after birth. The 4-hydroxytamoxifen was dissolved in warm 10% ethanol/corn oil 
vehicle, and 50 11 total volume was used per injection and delivered with a 30-gauge 
needle. Importantly, pups were injected in a blinded fashion without knowledge of 
genotypes, and there were no phenotypic indications within the first 5 days after 
birth. Pups were collected at specified time points 6 or 11 days after birth. 
Mouse embryos for branchial arch artery rescue. Embryos were collected at 
embryonic day 10.5 (E10.5), gross images were taken, and embryos were placed 
in 4% paraformaldehyde overnight. Samples were processed for histology as 
described. 

Histology. Embryos or mouse brains were fixed in 4% formaldehyde overnight, 
dehydrated in 100% ethanol, and embedded in paraffin. Sections (8-jum thick) were 
used for immunohistochemistry and haematoxylin and eosin staining. K/f2 in situ 
hybridization was performed as previously reported*”. The following antibodies 
were used for immunostaining: rat anti-PECAM (1:500, R&D MAB3628), rat 
anti-PECAM (1:20, Histo Bio Tech DIA-310), rabbit anti-versican (1:200, Millipore 
AB1033), rabbit anti- DPEAAE (1:200, Pierce- Antibodies PA1-1748A), mouse anti- 
KLF4 (1:100, R&D AF3158), rabbit anti-pSMAD3 (1:25, Abcam ab52903), rabbit 
anti-pMLC2 (1:200, Cell Signaling 3674S). 

Isolation of cerebellar endothelial cells. Cerebellar endothelial cells were isolated 
by enzymatic digestion followed by separation using magnetic-activated cell sorting 
(MACS MS system, Miltenyi Biotec). Mice were first anaesthetized with Avertin 
(Sigma Aldrich T48402) and perfused with sterile PBS. Cerebella of the mice were 
then digested by 1mg ml! collagenase/dispase (Sigma) and 0.02mgml~! DNase I 
(Sigma) in complete DMEM for 10 min at 37°C with gentle shaking. The digestion 
was then passed through a 70-\1m cell strainer (BD Biosciences). Cells were centri- 
fuged, resuspended and incubated with anti-mouse CD31-antibody-conjugated 
microbeads for 15 min at 4°C. Microbead-bound cells were then washed and sep- 
arated using MACS MS columns according to vendor protocol. Cells bound to 
the magnetic column were eluted and centrifuged for downstream applications, 
including western blotting and qPCR analysis. 

Western blotting of protein from isolated endothelial cells. Protein of freshly 
isolated cerebellar endothelial cells was purified using RIPA buffer with complete 
protease inhibitor cocktail (Roche) and PhosSTOP phosphatase inhibitor cocktail 
(Roche). The following antibodies were used for immunoblotting: rabbit anti- 
GAPDH (1:5,000, Cell Signaling 2118), rabbit anti-ADAMTS4 (1:1,000, Abcam 
ab28285), rabbit anti-SMAD3 (1:1,000, Cell Signaling 9513), rabbit anti-pSMAD3 
(1:1,000, Cell Signaling C25A9), rabbit anti-GFP (1:1,000, Abcam ab290). 

Gene expression analysis. Total RNA of isolated endothelial cells was isolated 
with that RNeasy micro kit (Qiagen). For qPCR analysis, cDNA was synthe- 
sized from 150 ng total RNA using the Superscript III Reverse Transcriptase 
(Invitrogen). Real-time PCR was performed with Power SYBR Green PCR 
Master Mix (Applied Biosciences) using the following mouse primers: 
Gapdh forward 5'‘-AAATGGTGAAGGTCGGTGTGAACG-3’, Gapdh reverse 
5'-ATCTCCACTTTGCCACTGC-3’; KIf2 forward 5’-CGCCTCGGGTT 
CATTTC-3’, Kif2 reverse 5‘-AGCCTATCTTGCCGTCCTTT-3’; Kif4 forward 
5'-GTGCCCCGACTAACCGTTG-3’, K/f4 reverse 5’-GTCGTTGAACTC 
CTCGGTCT-3’; Adamts1 forward 5’-CTCTCACCCTTCGGAATTTCTG-3’, 
Adamts1 reverse 5'‘-GGAGCCACATAAATCCTGTCTG-3’; Adamts4 forward 
5’-CAGTGCCCGATTCATCACT-3’, Adamts4 reverse 5’-GAGTCAGGACCG 
AAGGTCAG-3’; Adamts5 forward 5'‘-CGACCCTCAAGAACTTTTGC-3’, 
Adamts5 reverse 5'‘-CGTCATGAGAAAGGCCAAGT-3’; Adamts9 
forward 5‘-TTGGGACCTGCTCAAGAACG-3’, Adamts9 reverse 5/-ACCATTGAT 
GTTGAAGTGTTTGC-3; Axin2 forward 5‘-CAGCCCTTGTGGTTCAAGCT-3’, 
Axin2 reverse 5‘-GGTAGATTCCTGATGGCCGATGT-3’; Lefl forward 
5/-TAACGAGTCCGAAATCATCCCAGC-3’, Lefl reverse 5'-TTCATCAGG 
GTGTTCTCTGGCCTT-3’; Hes1 forward 5’-AAAGCCTATCATGGAGA 
AGAGGCG-3’, Hes! reverse 5’/-GGAATGCCGGGAGCTATCTTTCTT-3'; Heyl 
forward 5'’-GCGCGGACGAGAATGGAAA-3’, Hey1 reverse 5'-TCAGGTG 
ATCCACAGTCATGTG-3’; Hey2 forward 5‘-AAGCGCCCTTGTGAGGAAAC-3’, 
Hey2 reverse 5'-GGTAGTTGTCGGTGAATTGGAC-3’. 
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X-ray microCT-based quantification of neonatal, cerebellar lesions. Eleven- 
day-old pups were anaesthetized with Avertin and underwent intracardiac 
perfusion with PBS and 2% paraformaldehyde. Brains were quickly dissected and 
fixed overnight in a 2.5% glutaraldehyde, 4% formaldehyde, 0.1 M sodium phos- 
phate buffer. Then, brains were rinsed several times with 0.1 M sodium phosphate 
buffer and hindbrains were detached from mid/forebrain structures by severing the 
pons. Next, hindbrains were soaked overnight in 2% osmium tetroxide solution 
(tissue contrast agent) and washed several times with water. 

Hindbrains were randomized and scanned by a blinded operator using Xradia 
MicroCT system (Xradia MicroXCT-400). Images were acquired at 50kV, 10 W, 
721 projections, 3-s integration per 180° rotation. 

Raw image stacks from each scanned hindbrain were analysed using Avizo 3D 
image processing software (Lite edition, FEI Visualization Sciences Group). Each 
image stack was imported and lesions were labelled in semi-automated, blinded 
fashion using a region-growing segmentation algorithm included with the soft- 
ware. Labelled lesions could then be analysed for pixel volume and converted to 
cubic micrometres. Importantly, to prevent measurement bias, a single individual, 
without direct experimental involvement or knowledge of genotypes, was used to 
label lesions of randomly ordered brains. This individual was trained before begin- 
ning analysis of the hindbrains. New-user accuracy was confirmed by comparison 
of results from a test brain analysis with the results from an experienced user. 
A flow-chart of this process is shown in Extended Data Fig. 9. 

Post-lesion labelling, the hindbrain image stack was volume rendered and 
over-laced with the labelled lesions in the Avizo 3D environment. Orientation, 
projection depth and shadow/lighting effects of each volume render were adjusted 
for publication quality images. Importantly, these changes were made post-lesion 
labelling, and did not affect lesion detection or their visual presentation. 

We blinded samples at three distinct points in the analysis. First, neonatal CCM 
model pups were injected with 4-hydroxytamoxifen without knowledge of geno- 
types. Second, hindbrains from genotyped animals were given randomized labels 
to provide for blinded microCT scanning by an independent operator. Third, 
randomized microCT image stacks were analysed in a blinded manner by a single 
operator not involved in any prior experimental steps. 
cDNA constructs and transient expression in HEK293T cells. The human 
KRIT1-Flag expression plasmid was a gift from D. Marchuk*. Human CCM2- 
Flag, PDCD10-Flag and MEKK3-Flag expression plasmids were purchased from 
Origene (RC201418, RC200235 and RC210317). Further epitope tag modifications 
and CCM2 truncations were generated by PCR and conventional cloning meth- 
ods. The CCM2 CCCTdup mutant was generated by site-directed mutagenesis to 
generate the exact four-nucleotide duplication (Agilent 200521). One microgram 
of each construct was transfected in HEK293T cells using Fugene 6 (Promega 
E2691), and cells were collected 48h later with gentle lysis buffer (Life Tech 87787) 
supplemented with protease and phosphatase inhibitors (Roche 11873580001 and 
04906845001). 

Co-immunoprecipitation and western blotting. Anti-HA antibody (51g per 
co-immunoprecipitation (co-IP), Sigma H3663) conjugated to Protein G 
Dynabeads (50,11 per co-IP, Life Technologies 10003D) was used for all exper- 
iments. Protein (250\1g) from HEK293T cells was used for each co-IP with an 
incubation period of 30 min at room temperature. HEK293T cells were obtained 
directly from the ATCC, authenticated and tested negative for mycoplasma. 
Anti-Flag (1:5,000, Santa Cruz sc-807) and anti-HA (1:5,000, Santa Cruz sc-805) 
antibodies were used for western blotting and re-probed with anti-GAPDH 
antibody (1:10,000, Cell Signaling 2118). For co-IP blots of PDCD10-Flag, horse- 
radish peroxidase (HRP)-conjugated anti-Flag antibody (1:2,000, Sigma A8592) 
was used to avoid antibody light chain detection. 

Expression of tetracycline-inducible CCM2 CCCTdup lentivirus in cultured 
endothelial cells. Amino-terminal, HA-epitope tagged, human CCM2 CCCTdup 
was cloned into the pLVX-TRE3G lentiviral vector (Takara Clontech), and lenti- 
virus was generated from co-transfection with packaging plasmids in HEK293T 
cells as previously described'?. HUVECs were co-infected with CCM2 CCCTdup 
along with Tet-On 3G lentiviruses and varying doxycycline amounts were added 
as previously described!”. Cells were collected 48h after doxycycline addition. 
Collected cells were pelleted, resuspended, then divided in half. One half was 
used for total RNA isolated by TRIzol (Life Technologies), and the other half was 
used for protein extracted by gentle lysis buffer as described above. cDNA was 
generated from 500 ng total RNA using the Superscript III Reverse Transcriptase 
(Invitrogen). mRNA levels of the CCM2 CCCTdup lentivirus were assessed by 
qPCR using both N-terminal and C-terminal human CCM2 qPCR primers as 
follows: CCM2 N-term forward 5’-CATACCAGGATACCTGAATCCCT-3’, 
CCM2 N-term reverse 5’-AGCTTGACGT TGTACGCAGAC-3’; CCM2 C-term 
forward 5‘-GCCTCTATCCACGAGTTCTGCA-3’, CCM2 C-term reverse 
5/-AAGTGCTGGCTGTCCTTCTCAG-3’. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Western blotting was performed as previously described with 751g protein from 
HUVEC lysate for each condition. 
siRNA knockdown and adenoviral overexpression in cultured endothelial 
cells. siRNA experiments in HUVECs were performed as previously described!”. 
HUVECs from Lonza were authenticated and tested negative for mycoplasma. siR- 
NAs directed against KRIT1 (s2510, Invitrogen), KLF2 (s20270, Invitrogen), KLF4 
(s17793, Invitrogen), STK24 (s15993, Invitrogen) or STK25 (s20570, Invitrogen) 
were used for the knockdown experiments. For overexpression studies, human 
microvascular endothelial cells (HMVECs) were infected with adenovirus encod- 
ing either mouse K/f2 (Penn Vector Core), human KLF4 (Vector BioLabs), or a 
LacZ control (Penn Vector Core). HMVECs from Lonza were authenticated and 
tested negative for mycoplasma. Cells were collected 48 h after infection, and total 
RNA was isolated using TRIzol Reagent (Life Technologies). cDNA was generated 
from 1 j1g total RNA using SuperScript First-strand Synthesis System (Invitrogen) 
and real-time qPCR was performed using Power SYBR Green PCR Master Mix 
(Applied Biosciences). 

Human primers for qPCR: KLF2 forward 5’‘-CTACACCAAGAG 
TTCGCATCTG-3’, KLF2 reverse 5’-CCGTGTGCTTTCGGTAGTG-3’; KLF4 
forward 5’‘-AGAGTTCCCATCTCAAGGCA-3’, KLF4 reverse 5’-GTCAGTTC 
ATCTGAGCGGG-3’; ADAMTS4 forward 5’-CTGACTTCCTGGACAATGGC-3’, 
ADAMTS4 reverse 5'‘-GCGGTCAGCATCATAGTCCT-3’. 

Drug treatments in cultured endothelial cells with loss of KRIT1. After siRNA 
knockdown of KRIT1, HUVECs were treated with BIX02189 (101.M), XMD17- 
109 (11M), hydroxyfasudil (10 uM), tempol (101M) or vitamin D3 (10jM) for 
24h before collection. 

KLF2 and KLF4 immunostaining in human CCM lesions and control brain 
sections. Sections of human samples were obtained from CCM lesions resected 
at University of Chicago using Institutional Review Board approved protocols. 
Control samples were obtained from two autopsy subjects. Rabbit anti- KLF2 
(1:200, Abcam ab203591), rabbit anti- KLF4 (1:50, Cell Signaling 4038S) and 


mouse anti-PECAM-1 (1:1,000, Cell Signaling 3528S) were used for immuno- 
staining. 

Patient information pertaining to the human CCM samples studied is summa- 
rized below. KRIT1 familial: female, age 11, with the CCM1 Common Hispanic 
Mutation, removed owing to CCM haemorrhage. PDCD10 familial: female, age 19, 
PDCD10 474+5G>A mutation, removed owing to CCM haemorrhage. Sporadic 
1: female, age 24, solitary sporadic lesion, removed owing to headaches and lesion 
growth. Sporadic 2: male, age 31, solitary sporadic lesion, removed owing to seizures. 
Statistics. Sample sizes were estimated based on our preliminary findings for 
rescue of the CCM phenotype. These were estimated from visual analysis of the 
hindbrains of P11 animals from which lesion numbers and sizes can be directly 
assessed, an observation corroborated by microCT-based volume rendering of 
lesions. These preliminary assessments suggested that there were very large differ- 
ences between genetic rescue and non-rescue samples (conservatively 80% rescue) 
that would allow statistical interpretation with relatively small n values. All exper- 
imentals and controls were littermates, and none was excluded from analysis. 
P values were calculated using an unpaired two-tailed Student's t-test, analysis of 
variance (ANOVA), or chi-square analysis as indicated. The mean and s.e.m. are 
shown in the bar graphs. 
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Extended Data Figure 1 | Histological characteristics of early CCM 
lesions and the cerebellar white matter in which they form. a, P7 and 
P8 CCM lesions in the Krit1 model. Hindbrains from Krit1®°*° mice 
were collected at P7 and P8, and sections were stained with haematoxylin 
and eosin to detect vascular malformations. Images are representative 

of 4 independent experiments. Dotted lines indicate the cerebellar white 
matter; arrows indicate CCM lesions. b, Versican is abundant in the white 
matter of the P7 hindbrain, the site of primary CCM formation in the 
neonatal mouse model. Haematoxylin and eosin staining of the hindbrain 
at low (top left) and high (bottom left) magnifications. Antibody staining 
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reveals abundant versican protein in the white matter of the hindbrain 
(centre), the site at which CCM lesions specifically appear at this time 
point in the Krit1 mouse model, and less abundant versican in the cortex. 
DPEAAE staining reveals a pattern of ADAMTS-mediated versican 
degradation that is the inverse of intact versican; that is, higher in the 
cortex and lower in the white matter (right). Results are representative of 

3 independent experiments. Scale bars, 500 1m (top) and 100|1m (bottom). 
Boxes indicate regions shown at higher magnification below; dotted lines 
indicate the cerebellar white matter. 
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Extended Data Figure 2 | Endothelial Rho activity, but not 3-catenin or are representative of 3 independent experiments. d, qPCR analysis of 


SMAD3 signalling, is increased during CCM formation. 8-catenin target genes in hindbrain endothelial cells (n= 4-5). P>0.05 

a, Immunostaining for the endothelial marker PECAM and pMLC in the (unpaired two-tailed Student’s t-test) for comparison of all values. Error 
white matter of the cerebellum of P6 control and Krit1®°*° littermates. bars indicate s.e.m. e, Immunostaining for phosphorylated-SMAD3 

Scale bars, 50|1m. b, Anti-GFP immunostaining to detect TCF/Lef:H2B- (pSMAD3) and PECAM. Scale bars, 501m. f, Total SMAD3 and psMAD3 
GFP Wnt/®-catenin reporter (WntREP) activity. Scale bars, 50 jum. were detected by immunoblotting cerebellar endothelial cell lysates. 

c, Immunoblotting of P6 brain endothelial cell lysate for GFP. Results Results are representative of 3 independent experiments. 
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Extended Data Figure 3 | Hes1 expression rises after primary CCM 
lesion formation in Krit1®©*° hindbrain endothelial cells, and is 
reversed by Map3k3 haploinsufficiency. a, qPCR analysis of Notch target 
genes in endothelial cells freshly collected from the hindbrain of P6 or P11 
Krit1EC®° animals compared with control, tamoxifen-treated littermates 
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(n=4). **P <0.01 (unpaired two-tailed Student's t-test). b, qPCR 
analysis of gene expression in endothelial cells freshly collected from 
the cerebellum of P11 Kritl' and Krit1®C*° animals, with and without 
endothelial loss of one Map3k3 allele (n= 4). **P < 0.01 (unpaired two- 
tailed Student's t-test). Error bars indicate s.e.m. 
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Extended Data Figure 4 | Map3k3 haploinsufficiency rescues both mid-gestation (right). Note the presence of pericardial oedema due to a 
early embryonic lethality due to endothelial loss of Krit1 and postnatal persistent cardiac defect in the Tie2°";Krit", Map3k3* embryo. 
CCM lesion formation due to endothelial loss of Cem2. a, Partial loss Images are representative of 3 independent experiments. b, Rescue of 
of MEKK3 rescues early lethality associated with endothelial deletion CCM formation in the Ccm2 model with loss of MEKK3. iECre;Ccm2", 
of Krit1. E10.5 littermate embryos are shown at the same magnification. Map3k3*!* (Cem2®CX°) and iECre;Ccm2"; Map3k3!”* neonates were 
Tie2“*;Krit “" animals lacking endothelial KRIT1 die by E9.5 due to lack induced with tamoxifen at P1 and lesion formation scored by visual 
of patent branchial arch arteries (middle). By contrast, Tie2"; Kritu™'; detection in the hindbrain at P11 as done for Krit1®*° animals. Images 
Map3k3"* animals develop patent arteries and survive past are representative of 4 independent experiments. Scale bars, 1mm. 
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Extended Data Figure 5 | Rescue of CCM formation in Krit1®“*° but venous dilatation was evident (arrow). Images are representative of 
animals with homozygous loss of KIf2. Haematoxylin and eosin 3 independent experiments. Dotted lines indicate the cerebellar white 


sections through the hindbrain of P11 iECre;Kritl"KIf2" and control matter. Scale bars, 500 jum (left), 100 jum (middle) and 50 jm (right). 
iECre;Krit 1" animals are shown. No true CCM lesions were observed 
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Extended Data Figure 6 | CCM2 binds MEKK3 via its C-terminal 
helical harmonin domain. a, MEKK3 binds the C-terminal domain of 
CCM2. Top, schematic of the CCM2 protein domains. Bottom, series 

of N-terminal-truncated CCM2-HA proteins used to map the CCM2- 
MEKK3 binding region. MEKK3-Flag and the indicated N-terminally 
truncated CCM2-HA proteins were co-expressed in HEK293T cells before 
immunoprecipitation with anti-HA antibodies and immunoblotting 

with anti-Flag antibodies to detect associated MEKK3 proteins (top). 
Immunoprecipitated HA-CCM2 was detected with anti-HA antibodies 
(below). b, MEKK3-Flag and the indicated C-terminally truncated 
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CCM2-HA proteins were co-immunoprecipitated to map the CCM2- 
MEKK3 binding region. c, CCM2 CCCTdup protein is stably expressed in 
cultured endothelial cells. CCM2 CCCTdup-HA protein was expressed 
in cultured HUVECs using a tetracycline-inducible lentiviral vector 

at varying doses of doxycycline. The total (that is, endogenous CCM2 
plus viral CCM2 CCCTdup) CCM2 mRNA levels were measured using 
qPCR (bottom), and CCM2 CCCTdup protein detected using anti-HA 
immunoblotting (top). GAPDH immunoblots are shown for loading 


control. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


KEE 


— _ Co Scr. 


_ 1 GS sikif2 
(eo) i 
@ Mm siKif4 
© 1.0 
z 
Lu 
® 
2 05 
& 
() 
wd 
0) 
e& gg ¢ 
& 


b KXEK 


20 
= - CJ Ad-LacZ 
= 15 
72) 
@ 10 HXKK eS Ad-klf2 
Z 6 = 
ui 4 n 
od) 
2 3 
: : y 
{) 
w 1 
¢) 


Cc 40 
5 CJ Ad-LacZ 
D 
35 
2 vex MD Ad-Kifa 
= 
(ok 
Wo os 
Oo 4 
= 8 
— 
IS: 2 
2 4 
a 
0) 
KL S b\ 
\ a 
@ Co 
Extended Data Figure 7 | The versican-degrading ADAMTS proteases to expression following exposure to control adeno-LacZ virus. 
are regulated by KLF2 and KLF4. a, siRNA knockdown of KLF2 or KLF4 c, Adenoviral expression of KLF4 drives expression of ADAMTS4 in 
in cultured HUVECs reduces ADAMTS4 expression, as measured by addition to eNOS. KLF2 levels are reduced in KLF4-overexpressing 
qPCR. b, Adenoviral expression of KLF2 drives expression of the versican- | HUVECs. Results are shown relative to expression after exposure to 
degrading proteases ADAMTS1, ADAMTS4 and ADAMTS9 in addition to control adeno-LacZ virus (n = 4-5). *P < 0.05; **P< 0.01; ***P< 0.001 
the known KLF2/4 target gene eNOS (also known as NOS3). KLF4 levels (unpaired two-tailed Student's t-test). Error bars indicate s.e.m. 


are reduced in KLF2-overexpressing HUVECs. Results are shown relative 
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Extended Data Figure 8 | The MEKS inhibitor BIX02189 and the or without BIX02189 (101M) and XMD17-109 (111M) (MEK5 and ERK5 
ERKS inhibitor XMD17-109 reverse the increase in KLF2 and KLF4 inhibitors, respectively) for 24h, and the levels of KLF2 and KLF4 mRNA 
expression conferred by loss of KRIT1 in cultured endothelial cells. were measured by qPCR (n= 3). **P < 0.01; ***P < 0.001 (unpaired two- 
HUVECs were treated with scrambled or KRIT1-targeted siRNAs with tailed Student's t-test). Error bars indicate s.e.m. 
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Extended Data Figure 9 | Imaging and volume measurement of mouse hindbrain CCM lesions using microCT. The imaging of P11 mouse hindbrains 
and analysis of the raw data to create composite CCM lesion images and volumes are shown. 
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Structural basis of lenalidomide-induced CKla 
degradation by the CRL4©®®" ubiquitin ligase 


Georg Petzold!”, Eric S. Fischer)?+ & Nicolas H. Thoma!? 


Thalidomide and its derivatives, lenalidomide and pomalidomide, 
are immune modulatory drugs (IMiDs) used in the treatment of 
haematologic malignancies’”. IMiDs bind CRBN, the substrate 
receptor of the CUL4-RBX1-DDB1-CRBN (also known as 
CRL4©®29) £3 ubiquitin ligase*, and inhibit ubiquitination of 
endogenous CRL4°®®N substrates*. Unexpectedly, IMiDs also 
repurpose the ligase to target new proteins for degradation. 
Lenalidomide induces degradation of the lymphoid transcription 
factors Ikaros and Aiolos (also known as IKZF1 and IKZF3)°~’, 
and casein kinase 1a (CK1a)°, which contributes to its clinical 
efficacy in the treatment of multiple myeloma’ and 5q-deletion 
associated myelodysplastic syndrome (del(5q) MDS), respectively. 
How lenalidomide alters the specificity of the ligase to degrade 
these proteins remains elusive. Here we present the 2.45 A crystal 
structure of DDB1-CRBN bound to lenalidomide and CK1a. CRBN 
and lenalidomide jointly provide the binding interface for a CKl1a 
8-hairpin-loop located in the kinase N-lobe. We show that CKla 
binding to CRL4°®® js strictly dependent on the presence of an 
IMiD. Binding of IKZF1 to CRBN similarly requires the compound 
and both, IKZF1 and CK1q, use a related binding mode. Our study 
provides a mechanistic explanation for the selective efficacy of 
lenalidomide in del(5q) MDS therapy*”. We anticipate that high- 
affinity protein-protein interactions induced by small molecules 
will provide opportunities for drug development, particularly for 
targeted protein degradation. 

Thalidomide, lenalidomide and pomalidomide are used in the treat- 
ment of multiple myeloma'”. In del(5q) MDS patients, lenalidomide 
shows pronounced clinical efficacy compared to thalidomide and 
pomalidomide”*!°, which has been linked to induced degradation of 
CK1a (ref. 8). Del(5q) MDS is a haematologic disorder in which dele- 
tion of the long arm of chromosome 5 eliminates one casein kinase 1 
alpha allele (CSNK1A1). Heterozygous deletion of CSNK1A1 causes 
haematopoietic stem cell hyperproliferation, whereas homozygous loss 
of both alleles results in stem cell failure and apoptosis'!. CSNKIA1 
haploinsufficiency thus sensitizes malignant del(5q) MDS cells to 
lenalidomide-induced CK1a degradation, while wild-type cells remain 
unaffected®, 

To characterize the binding of CK1a to CRL4“884 in vitro, we 
reconstituted the complex and measured the affinity of neddylated 
CRL4CBEN (veCRL4ER®N) to CK 1a using time-resolved fluorescence 
resonance energy transfer (TR-FRET) (Extended Data Fig. 1a, b; see 
Methods). At saturating lenalidomide concentrations (Extended Data 
Fig. 1c), nsCRL4°"®N and CK1a tightly associate with an apparent 
dissociation constant (Kp*??) of 242 + 15nM (Fig. 1a and Extended 
Data Fig. 1d). In contrast, nsCRL4@8"N did not bind CK1a in the 
absence of lenalidomide, or when rendered IMiD-binding deficient 
(IBD) by mutations in the IMiD-binding domain of CRBN (Fig. la and 
Extended Data Fig. le). In the presence of IMiD analogues, pomalido- 
mide and thalidomide, ysCRL4-°5N binding to CK1a was weakened 


~2-fold (Kp*??(pomalidomide) = 608 + 60nM) and ~5-fold 
(Kp’??(thalidomide) = 1,122 + 109nM) compared to lenalidomide 
(Fig. 1b and Extended Data Fig. 1c). 

To understand the molecular basis of CK1a degradation in del(5q) 
MDS, we pursued the high-affinity complex of CRL4C88N, CK1a and 
lenalidomide for structural studies. The modular CRL4©®®N assembly 
adopts a U-shaped architecture’, in which the DDB1 adaptor protein 
mediates interactions between CRBN and the CUL4-RBX1 ligase 
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Figure 1 | Overall structure of the DDB14®?8_CRBN-lenalidomide-CK1a 
complex. a, TR-FRET. Wild-type (WT) or IMiD binding deficient (IBD) 
mutant ygCRL4°®® titrated to CK1a in the absence or presence of 541M 
lenalidomide (L). b, TR-FRET. ysCRL4°®®N—-CK1« complex formation in 
the absence (DMSO) or presence of 5 1M lenalidomide, pomalidomide 

or thalidomide. TR-FRET data in this figure are biological replicates 
presented as means + s.d. (n =3). c, Cartoon representation of DDB14??5_ 
CRBN-lenalidomide-CK1a. Top left, DDB1, CRBN and CK1a domain 
colouring and boundaries. Hatched lines indicate sequences omitted in the 
crystallized proteins. Bottom right, close-up of the CRBN-lenalidomide- 
CK1a interface. Composite omit 2F, — F, map contoured at 2.00. 
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Figure 2 | CK1a interacts with CRBN and lenalidomide. 

a, Superposition of CRBN-lenalidomide-CK1a and human CRBN 
bound to lenalidomide (PDB entry 4TZ4). b, Chemical structures 

of lenalidomide, pomalidomide and thalidomide. Arrows indicate 

C3 position of the phthalimide ring. c, CRBN binds the glutarimide 
moiety in a hydrophobic pocket. The phthalimide ring of lenalidomide 
is surface exposed. A B-hairpin loop of CK1a binds CRBN on top of 
its lenalidomide-binding pocket. d, Side-chain interactions between 
CK1a, CRBN and lenalidomide. Lenalidomide provides van der Waals 
interactions with CK1a. CRBN residues that form the hydrophobic pocket 
are shown in grey. Dashed lines indicate hydrogen bonds. 


arm. This arrangement juxtaposes the CRBN substrate receptor 
to RBX1, the subunit that recruits the activated E2-Ub to the com- 
plex. DDB1 is composed of three 3-propeller domains (BPA, BPB, 
BPC). Only two B-propellers (BPA, BPC) are necessary and suffi- 
cient for CRBN binding*!?. The BPB domain interacts with CUL4 
and was removed to facilitate crystallization (DDB14??8), Crystals 
were obtained for the 178-kDa DDB14?8_CRBN-lenalidomide- 
CK1a complex and its structure was determined to 2.45 A resolution 
(Extended Data Table 1). The unit cell of the crystal contained two 
identical complexes, one of which refined to considerably lower B 
factors (chain B and chain C; Extended Data Fig. 2a—c). 

We find that CK1a exclusively interacts with the C-terminal 
domain of CRBN (CTD; residues 317-442), which accommo- 
dates lenalidomide in a hydrophobic pocket* (Fig. 1c). CRBN 
and lenalidomide together provide the interaction surface for 
CK1a (CRBN-CK1a, 718.1 A?; CRBN-lenalidomide, 236.6 A’; 
lenalidomide-CK1a, 96.8 A’; total, 1,051.5 A). No contacts are 
observed between CK1a and DDB1. The kinase domain of CK1a 
adopts an active conformation”, although the nucleotide binding 
pocket is not occupied by ATP (Extended Data Fig. 3). CK1a binding 
repositions the phthalimide ring of lenalidomide 2.5 A towards CRBN 
residue Glu377, which resides in the loop linking 6-stands 12 and 13 
(Fig. 2a). This shifts the lenalidomide C3 phthalimide to the backbone 
carbonyl of Glu377. Thalidomide and pomalidomide carry a carbonyl 
group at their C3 phthalimide position (Fig. 2b) that would result in 
clashes with the CRBN backbone (Extended Data Fig. 4). This obser- 
vation provides an unanticipated rationale for the preference of CK1a 
for lenalidomide during complex formation with CRL4C®®N (Fig. 1b), 
and explains how a buried carbonyl, absent in lenalidomide and distant 
from the IMiD-CK1a interface, contributes to target specificity. 

CK1a binds CRBN-lenalidomide via a 3-hairpin loop (residues 
35-41) in the kinase N-lobe that connects the CK1a 6-strands 2 and 3 
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Figure 3 | CK1a and IKZF1 share a common CRBN interface. 

a, Schematic CRBN-lenalidomide-CK1a interface. b, Wild-type or 
mutant ygCRL4C®®N titrated to CK1« in the presence of lenalidomide. 

¢, nsCRL4°8N titrated to mutant CK1« in presence of lenalidomide (L). 
d, Unlabelled IKZF1®-19°39-255 (0,02-20 1M) titrated to pre-assembled 
nsCRL4C®®N_lenalidomide-CK1a complexes (500 nM ngCRL4CB8N, 
100nM CK 1a, 5M lenalidomide; blue graph). ngCRL4CRBN_ 
lenalidomide-CK1a (green). ygCRL4°®®N, CK1« without lenalidomide 
(red). e, Wild-type or mutant ysCRL4“ titrated to wild-type IKZF11-19939-255 
in the presence of pomalidomide. TR-FRET data (b-e) are biological 
replicates presented as means + s.d. (n = 3). f, CRBN surface contacted by 
CK1la and IKZF1. 


(Fig. 2c). Residues within this loop interact with the otherwise 
solvent-exposed phthalimide ring of lenalidomide and contact CRBN 
residues that surround the lenalidomide binding pocket (Fig. 2d and 
Fig. 3a). Mutation of CRBN residues His353 to aspartate and Tyr355 
to alanine did not prevent IMiD binding (Extended Data Fig. le), but 
affected CK1a complex formation (Fig. 3b and Extended Data Fig. 4f). 
The side chain of CK1a Ile35 provides hydrophobic contact with C5 
and C6 of the lenalidomide phthalimide ring and CK1a Gly40 is sand- 
wiched between phthalimide C7 and the side chain of CRBN Val388 
(Figs 2d and 3a). Sequence alignments between casein kinase 1 and 2 
isoforms reveal substitution of Gly40 for a bulky asparagine residue 
in CK2a (Extended Data Fig. 5a, b), and mutation of CK1a Gly40 to 
asparagine abolished CRBN binding (Fig. 3c). Consistently, levels of 
CK2« are not reduced in cells treated with lenalidomide (Extended 
Data Fig. 5c). Together, these observations emphasize the importance 
of the 6-hairpin residue Gly40 for CK1 a degradation. 

The side chain of CRBN Val388 is in close proximity to the CK1a 
residues Gly40 and Glu41. Val388 is substituted for isoleucine in CRBN 
from chicken (Ile390) and mouse (Ile391; Extended Data Fig. 5d, e). 
Mutation of Val388 to isoleucine in human CRBN led to an 
~4-fold reduction of the apparent Kp (1,095 +99 nM; Fig. 3b), which 
is in agreement with steric clashes expected upon substitution 
(Extended Data Fig. 5e). Similarly, the Val388Lys mutation rendered 
CRBN unable to interact with CK1a in vitro (Fig. 3b). Mouse cells with 
the naturally occurring CRBN Ile391 are largely resistant to IMiD- 
induced CK1a degradation®. The Ile391Val mutation in mouse CRBN 
alone was sufficient to restore IMiD-induced degradation of CK1a and 
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Anti-CK1o. western blot 
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Figure 4 | CK1c is juxtaposed to the active centre of the CRL4C88N 
ligase. a, In vitro ubiquitination of CK1a by ysCRL4C88N depends on 
lenalidomide. WT, wild-type. IBD, IMiD binding deficient ngCRL4CR8N 
mutant. KO, lysine-less ubiquitin. b, In vitro ubiquitination of CKla by 
nsCRL4°®® in the presence different IMiDs at 350 nM ([IMiD] ~ Kp). 

c, Model of CRL4“®®% bound to CK1a (side view). d, Rotational flexibility 
of the RBX1 subunit upon cullin neddylation and CUL4 ligase arm 
mobility increase the E2 accessible surface on CK1a (top view). Red 
spheres indicate the active site cysteine of the E2. 


that of two additional CRL4C®®N substrates, IKZF1 and IKZE3 (ref. 8), 
which is consistent with our structural and biochemical findings. 

Next we examined the spatial relationship between the binding 
sites of CK1a and the Ikaros family transcription factors on CRBN. 
A truncated form of human IKZF1 (see Methods) bound ygCRL4CR8N 
with a higher affinity (Kp*??(lenalidomide) 101 +2nM) than CKla 
(Extended Data Fig. 6) and efficiently displaced CK1a from CRBN 
(ICs0, 712 + 11 nM; Kj, 89 +3 nM; Fig. 3d). Thus, CK1a and IKZF1 
recruitment is mutually exclusive. We then tested the effects of dif- 
ferent IMiDs on IKZF1 recruitment and observed that lenalidomide 
(Kp*?? = 101 +2nM) and pomalidomide (Kp*?P = 96 + 24nM) medi- 
ated IKZF1 binding to CRBN with comparable affinities, whereas in 
the presence of thalidomide the apparent Kp was reduced ~1.5-fold 
(Kp??P = 155 + 33 nM) (Extended Data Fig. 6a). No binding between 
IKZF1 and CRBN was detected in the absence of IMiDs. Recruitment 
of CKlaand IKZF1 to CRL4©®N thus strictly depends on the presence 
of an IMiD, and occurs through overlapping binding sites on CRBN 
(Fig. 3e, f). 

IKZF1 contains four amino-terminal C2H2-type zinc-fingers!* 
(residues 118-225; Extended Data Fig. 7a). IKZF1 Gln 146 within zinc- 
finger 2 (ZF2), when replaced by histidine, was shown to confer resist- 
ance to IMiD-induced degradation in cells°. IKZF1 ZF2 has strong 
sequence homology to a known zinc-finger structure (PDB 2113) 
(Extended Data Fig. 7b). The B-hairpin loop (residues 146-151) in 
the predicted structure of IKZF1 ZF2 shows striking conformational 
similarity to that of CK1« (residues 35-40; Extended Data Fig. 7c). 
IKZF1 residue Gln146 is part of this loop, and the Gln146His mutation 
impaired binding to CRBN-pomalidomide in vitro (Extended Data 
Fig. 7d). There is little sequence conservation between CK1a and 
IKZF1 with the exception of Gly151 (equivalent to Gly40 in CK1a; 
Extended Data Fig. 7b). Mutation of IKZF1 Gly151 (Gly151Asn) 
or two zinc-coordinating residues (Cys147Ala, His167Ala) in ZF2 
ablate pomalidomide-dependent IKZF1 binding to CRBN, as well as 
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substitution of Leul66 (Leu166Glu; Extended Data Fig. 7d). The addi- 
tional contact mediated by Leul66 seems unique to IKZF1 (Extended 
Data Fig. 7e-i). The overlay of CK1a with IKZF1 ZF2 further predicts 
that DNA bound by IKZF1 would clash with CRBN (Extended Data 
Fig. 8a). In electrophoretic mobility shift assays and TR-FRET experi- 
ments, we observed that recruitment to DDB1-CRBN-pomalidomide 
is impaired when IKZF1 is engaged with its consensus DNA (Extended 
Data Fig. 8b-d). Thus, our results strongly suggest that IKZF1 uses 
a structurally similar 3-hairpin loop containing the glycine signature 
for binding the CRBN-IMiD interface, which explains mutual exclu- 
sive binding of IKZF1 and CK1a. Additional zinc-finger proteins were 
found downregulated in the presence of lenalidomide (for example 
ZNF692, ZFP91) (ref. 8), and these carry similar structural motifs. 

Lenalidomide recruits CK1a to CRL4°®®N, which results in CK10 
degradation in MDS-L cells*. Experiments in a fully reconstituted 
system further demonstrate that CK1« binding to ygCRL4C®® is 
sufficient to drive its ubiquitination in vitro (Fig. 4a). In case of lenalido- 
mide, high-affinity CK1a binding leads to polyubiquitination (Fig. 4b), 
whereas weaker binding in the presence of pomalidomide and 
thalidomide (Fig. 1b and Fig. 2a, b) results in reduction of higher-order 
ubiquitin conjugates. This difference is only evident at non-saturating 
IMiD concentrations (Fig. 4b and Extended Data Fig. 9a). We expect 
this to be the case also in cells owing to the reported low solubility 
and permeability of IMiDs!*~'®. Assuming comparable concentrations 
of the three IMiDs in cells, differences in CK1a binding coupled to 
processive ubiquitination may account for the strong cellular prefer- 
ence for lenalidomide over pomalidomide and thalidomide in targeting 
CK 1a for degradation’. 

Given the modular nature of CRL4 ligases!?°, the DDB145?3_ 
CRBN-lenalidomide-—CK1a complex can be combined with existing 
DDB1-CUL4-RBX1 structures into a model of the entire CRL4C82N— 
lenalidomide-CK1« assembly (Fig. 4c). The unique flexibility of the 
CULA4 ligase arm combined with unhooking of RBX1 upon cullin 
neddylation”! creates an exceptionally large ubiquitination zone’ 
(340A x 110A x 30A) that brings E2-Ub bound to RBX1 into direct 
contact with CK1a (Fig. 4d). Mapping lenalidomide-induced CK1a 
ubiquitination sites identified Lys65 and Lys225 (numbering according 
to CK1a isoform 1) in vivo’ and Lys8, 62 and 179 in vitro (Extended 
Data Fig. 9b). These lysine residues are surface exposed and readily 
accessible to the E2 in our crystallographic model. These results illus- 
trate, for the first time, how a cullin-RING ligase engages a large pro- 
tein substrate, which leads to its ubiquitination by proximity to the E2. 
Numerous viral proteins make use of ubiquitination-by-proximity 
when hijacking the CRL1, CRL2, CRL4 or CRLS5 systems to target 
antiviral host proteins for degradation”*. Similarly, bifunctional 
chemical adaptors based on thalidomide-conjugated ligands (via the 
solvent-exposed phthalimide C4) were able to recruit diverse proteins 
to CRL4C®8N, which lead to their degradation by proximity”>4, 

Interestingly, low molecular weight compounds have been shown 
to modulate protein-protein interactions”® by either reinforcing pre- 
existing interactions”® or inducing de novo complex formation’’. We 
did not detect binding of either IKZF1 or CK1a to CRL4©®8 in the 
absence of IMiDs (Fig. 1b and Extended Data Figs 6a and 8c) and 
therefore, our results support an IMiD-induced de novo interaction 
between CKla, IKZF1 and CRL4°®8N_ Recruitment of the CK1a 
and IKZF1 neo-substrates to CRL4°85N js driven by a surprisingly 
small 96.8 A? compound-protein interface (phthalimide C5-C7) 
that induces high-affinity binding. It is currently unclear if IMiD- 
like metabolites or post-translational modifications exist”® that assist 
recruitment of endogenous substrates through a related mechanism 
(Extended Data Fig. 9c). 

Our work provides the first structural insight into how small 
molecules exploit an ubiquitin ligase for the destabilization of ther- 
apeutic targets, and offers a framework for designing future drugs 
that target proteins for proximity-based CRL4 ubiquitination and 
degradation. 


7 APRIL 2016 | VOL 532 | NATURE | 129 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 17 September 2015; accepted 7 January 2016. 
Published online 24 February 2016. 


1. Singhal, S. et a/. Antitumor activity of thalidomide in refractory multiple 
myeloma. N. Engl. J. Med. 341, 1565-1571 (1999). 

2. Bartlett, J. B., Dredge, K. & Dalgleish, A. G. The evolution of thalidomide and its 
IMiD derivatives as anticancer agents. Nature Rev. Cancer 4, 314-322 (2004). 

3. Ito, T. et al. Identification of a primary target of thalidomide teratogenicity. 
Science 327, 1345-1350 (2010). 

4. Fischer, E. S. et al. Structure of the DDB1-CRBN E3 ubiquitin ligase in complex 
with thalidomide. Nature 512, 49-53 (2014). 

5. Kronke, J. et a/. Lenalidomide causes selective degradation of IKZF1 and IKZF3 
in multiple myeloma cells. Science 343, 301-305 (2014). 

6. Lu, G. et al. The myeloma drug lenalidomide promotes the cereblon- 
dependent destruction of Ikaros proteins. Science 343, 305-309 (2014). 

7. Gandhi, A. K. et a/. Immunomodulatory agents lenalidomide and 
pomalidomide co-stimulate T cells by inducing degradation of T cell 
repressors Ikaros and Aiolos via modulation of the E3 ubiquitin ligase complex 
CRL4CREN. Br. J. Haematol. 164, 811-821 (2014). 

8. Kronke, J. et al. Lenalidomide induces ubiquitination and degradation of CKla 
in del(5q) MDS. Nature 523, 183-188 (2015). 

9. List, A. et al. Efficacy of lenalidomide in myelodysplastic syndromes. N. Engl. 

J. Med. 352, 549-557 (2005). 

10. Sekeres, M. A. & List, A. Lenalidomide (Revlimid, CC-5013) in myelodysplastic 
syndromes: is it any good? Curr. Hematol. Rep. 4, 182-185 (2005). 

11. Schneider, R. K. et a/. Role of casein kinase 1A1 in the biology and targeted 
therapy of del(5q) MDS. Cancer Cell 26, 509-520 (2014). 

12. Chamberlain, P. P. et a/. Structure of the human Cereblon-DDB1-lenalidomide 
complex reveals basis for responsiveness to thalidomide analogs. Nature 
Struct. Mol. Biol. 21, 803-809 (2014). 

13. Kornev, A. P, Haste, N. M., Taylor, S. S. & Eyck, L. F. Surface comparison of 
active and inactive protein kinases identifies a conserved activation 
mechanism. Proc. Nat! Acad. Sci. USA 103, 17783-17788 (2006). 

14. Schjerven, H. et al. Selective regulation of lymphopoiesis and leukemogenesis 
by individual zinc fingers of Ikaros. Nature Immunol. 14, 1073-1083 (2013). 

15. Zhou, S. et al. Transport of thalidomide by the human intestinal caco-2 
monolayers. Eur. J. Drug Metab. Pharmacokinet. 30, 49-61 (2005). 

16. Hofmeister, C. C. et a/. Phase | trial of lenalidomide and CCI-779 in patients 
with relapsed multiple myeloma: evidence for lenalidomide-CCl-779 
interaction via P-glycoprotein. J. Clin. Oncol. 29, 3427-3434 (2011). 

17. Krenn, M., Gamcsik, M. P., Vogelsang, G. B., Colvin, O. M. & Leong, K. W. 
Improvements in solubility and stability of thalidomide upon complexation 
with hydroxypropyl-beta-cyclodextrin. J. Pharm. Sci. 81, 685-689 (1992). 

18. Roche, S. et al. Development, validation and application of a sensitive LC-MS/ 
MS method for the quantification of thalidomide in human serum, cells and 
cell culture medium. J. Chromatogr. B 902, 16-26 (2012). 


130 | NATURE | VOL 532 | 7 APRIL 2016 


19. Fischer, E. S. et al. The molecular basis of CRL4082/CS4 ubiquitin ligase 
architecture, targeting, and activation. Cel! 147, 1024-1039 (2011). 

20. Angers, S. et al. Molecular architecture and assembly of the DDB1-CUL4A 
ubiquitin ligase machinery. Nature 443, 590-593 (2006). 

21. Duda, D. M. et al. Structural insights into NEDD8 activation of cullin-RING 

igases: conformational control of conjugation. Cel/ 134, 995-1006 (2008). 

22. Mahon, C., Krogan, N. J., Craik, C. S. & Pick, E. Cullin E3 ligases and their 

rewiring by viral factors. Biomolecules 4, 897-930 (2014). 

23. Winter, G. E. et a/. Phthalimide conjugation as a strategy for in vivo target 

protein degradation. Science 348, 1376-1381 (2015). 

24. Lu, J. et al. Hijacking the E3 ubiquitin ligase cereblon to efficiently target BRD4. 

Chem. Biol. 22, 755-763 (2015). 

25. Milroy, L. G., Grossmann, T. N., Hennig, S., Brunsveld, L. & Ottmann, C. 

odulators of protein-protein interactions. Chem. Rev. 114, 4695-4748 
(2014). 

26. Tan, X. et al. Mechanism of auxin perception by the TIR1 ubiquitin ligase. 
Nature 446, 640-645 (2007). 

27. Griffith, J. P. et al. X-ray structure of calcineurin inhibited by the immunophilin- 
immunosuppressant FKBP12-FK506 complex. Cell 82, 507-522 (1995). 

28. Hartmann, M. D. et al. Thalidomide mimics uridine binding to an aromatic 
cage in cereblon. J. Struct. Biol. 188, 225-232 (2014). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements This work was supported by the Novartis Research 
Foundation, the Swiss National Science Foundation (31003A_144020) and 
the European Research Council (ERC-2014-ADG 666068 CsnCRL). G.P. was 
supported by Long-Term Fellowships of the European Molecular Biology 
Organization (EMBO; ALTF-1350-2013) and the Human Frontier Science 
Program (HFSP; LT000210/2014). We thank G. M. Lingaraju for the DDB14®P8 
construct, M. Faty for assistance with protein purifications, J. Rabl and S. 
Kassube for help during diffraction data collection, R. Bunker for support 

with data processing and model building, W. A. Rahman and A. Potenza 

for assistance with fluorescence polarization, W. C. Forrester for help and 
comments, and J. Seebacher, R. Sack, D. Hess and M. Stadler for help with mass 
spectrometry experiments. We acknowledge the Paul Scherrer Institute for 
provision of synchrotron radiation beam time at beamline PXII and PXIIl of the 
SLS and would like to thank M. Wang for assistance. 


Author Contributions G.P,, E.S.F. and N.H.T. designed the experiments and 
interpreted the results. G.P. and E.S.F. carried out the experiments. G.P,, E.S.F. 
and N.H.T. wrote the manuscript. 


Author Information Structural coordinates for hsDDB14®°®-hsCRBN- 
lenalidomide-hsCK1a have been deposited in the Protein Data Bank under 
accession number 5FQD. Reprints and permissions information is available 
at www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the 
paper. Correspondence and requests for materials should be addressed to 
N.H.T. (nicolas.thoma@fmi.ch). 


© 2016 Macmillan Publishers Limited. All rights reserved 


METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized, and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Constructs and protein purification. Wild-type and mutant versions of human 
DDB1 (Q16531), human CRBN (Q96SW2), human CK1« (P48729) and human 
IKZF1 (Q13422) were subcloned into pAC-derived vectors”’ and recombinant pro- 
teins expressed as N-terminal Hisg, StrepII or StrepII-Avi fusions in Trichoplusia ni 
High-Five insect cells using the baculovirus expression system (Invitrogen). 
For purification of Hisc-DDB1-Hisg-CRBN, Hisg-DDB1*"?5-StrepII-CRBN“*!*° 
and Hisg-DDB143?5-Hisg-CRBN, cells were resuspended in buffer containing 
50 mM tris(hydroxymethyl)aminomethane hydrochloride (Tris-HCl) pH 8.0, 
200 mM NaCl, 0.25 mM tris(2-carboxyethyl)phosphine (TCEP), 1 mM phenyl- 
methylsulfonyl fluoride (PMSF), 1x protease inhibitor cocktail (Sigma-Aldrich) 
and lysed by sonication. Cells expressing StrepII-Avi-CK1a or truncated versions 
of StrepII-Avi-IKZF1 (A256-519, A197-238/A256-519 and A1-82/A197-238/ 
A256-519; full-length IKZF1 forms aggregates during purification) were lysed 
in the presence of 50mM Tris-HCl pH 8.0, 500mM NaCl, 0.25mM TCEP, 1 mM 
PMSF and 1x protease inhibitor cocktail (Sigma-Aldrich). Following ultracen- 
trifugation, the soluble fraction was passed over Strep-Tactin Sepharose (IBA) 
or His-Select nickel affinity resin (Sigma-Aldrich) and following elution, affinity 
tags were removed from CK1a and IKZF1 by overnight TEV protease treatment 
as indicated. The affinity-purified protein was further purified via ion exchange 
chromatography (Poros 50HQ and 50HS) and subjected to size-exclusion chro- 
matography in 50mM HEPES pH 7.4, 200 mM NaCl and 0.25mM TCEP. The 
protein-containing fractions were concentrated using ultrafiltration (Millipore) 
and flash frozen (DDB1-CRBN constructs at 40-120|1M, TEV cleaved CK1« at 
~280 1M, StrepII-Avi-CK1la at ~50,.M, TEV cleaved IKZF1 at ~350,1M and 
StrepII-Avi-IKZF1 at ~150|.M). Proteins were stored at —80°C. 

Crystallization and data collection. Attempts to crystallize CK1a, lenalidomide 
and CRBN with the full-length DDB1 adaptor protein were unsuccessful. Because 
the WD40 8-propeller B (BPB) of DDB1 is not involved in CRBN binding, we 
generated a human DDB1 construct in which a GNGNSG-linker replaced the 
BPB domain (residues 396-705; DDB14"?8), For crystallization of the DDB1A2PB_ 
CRBN-lenalidomide-CK1« complex, Hiss. DDB145?3_StrepII-CRBN“!” at 
70M was mixed with lenalidomide at 80|1M before the addition of TEV cleaved 
full-length CK1a at 801M. The mixture was incubated on ice for 1h and sub- 
sequently centrifuged at 20,000g for 30 min at 4°C. Crystallization plates were 
set up and stored at room temperature. Crystals appeared within 3 days after 
mixing the protein solution 1:1 with the reservoir containing 70 mM Tris pH 7.0, 
140mM MgCl, and 7% (w/v) PEG 8000 and continued growing until day 13 in 
MRC 2 Well Crystallization format vapour diffusion plates (Swissci). Crystals 
were cryo-protected in reservoir solution supplemented with 20% ethylene glycol 
and flash-cooled in liquid nitrogen. Diffraction data were collected at the Swiss 
Light Source (beamline PXII) with a Pilatus 6M detector at the wavelength of the 
Zn-edge (1.28162 A) and a temperature of 100 K. Data were indexed and integrated 
using XDS* and scaled using AIMLESS supported by other programs of the CCP4 
suite*!, The optimal high-resolution cut-off (2.45 A) was determined based on 
the half-set correlation criterion*” (CCj/2 = 0.45 for the highest resolution shell). 
Data processing statistics, refinement statistics and model quality parameters are 
provided in Extended Data Table 1. 

Structure determination and model building. The DDB14??8_CRBN- 
lenalidomide-CK1« quaternary complex crystallized in space group P1 with two 
complexes in the unit cell. PHASER*? was used to determine the structure by 
molecular replacement using a crystallographic model of DDB1 omitting the BPB 
domain, a CK1a homology model generated with Modeller* based on a CK16 
crystal structure (PDB entry 4TWC), and human CRBN (PDB entry 4TZ4) as 
search models. The initial model was iteratively improved with COOT and refined 
using PHENIX.REFINE* and autoBUSTER™, with ligand restraints generated by 
Grade server (Global Phasing). Figures were generated with PyMOL (The PYMOL 
Molecular Graphics System, Version 1.7.2.2 Schrédinger, LLC) and model quality 
was assessed with MOLPROBITY™~”. Interaction surfaces were determined with 
PISA*®. The IKZF1 homology model was calculated using Modeller based on a 
multiple sequence alignment with the experimental zinc-finger structures of Aart, 
YY1 and Kasio (PDB entries 2113, 1UBD, 2LT7). 

Labelling of NEDD8 with Alexa-488-maleimide. Purified human NEDD8(M1C) 
(NEDD8) was incubated with DTT (8 mM) at 4°C for 1 h. DTT was removed 
using a $200 16/60 gel filtration column in a buffer containing 50mM Tris pH 7.3 
and 150mM NaCl. Alexa-488-C5-maleimide (Invitrogen) was dissolved in 100% 
DMSO and mixed with NEDD8 to achieve fourfold molar excess of Alexa-488-C5- 
maleimide. NEDD8 labelling was carried out at room temperature for 3h ina 
vacuum desiccator and stored overnight at 4°C. Labelled NEDD8 was purified on 
a $200 16/60 gel filtration column in 50 mM Tris pH 7.5, 150mM NaCl, 0.25 mM 
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TCEP and 10% (v/v) glycerol, concentrated by ultrafiltration (Millipore), flash 
frozen (~40-80 1M) in liquid nitrogen and stored at —80°C. 
CRL4@ reconstitution and in vitro CUL4A neddylation with Alexa-488- 
NEDD8. In vitro CRL4©®®N reconstitution and CUL4A neddylation was per- 
formed as described*!*"!, Hiss-~CUL4A-His¢-RBX1 at 3.5}1M was incubated with 
Hiss-DDB1-Hisg—CRBN at 1.5}1M (wild-type or mutant forms) in a reaction mix- 
ture containing 3.8 1M Alexa-488-NEDD8, 50nM NAE1/UBA3 (E1), 150nM 
UBC12 (E2), 1mM ATP, 50 mM Tris pH 7.5, 100mM NaCl, 2.5mM MgCh, 
0.5mM DTT and 5% (v/v) glycerol for 2 h at room temperature. Gel filtration 
purified neddylated CRLASREN (. gCRLAC®N) was concentrated to 2-4 1M, flash 
frozen and stored at —80°C. Alexa-488-NEDD8 labelling efficiency of different 
CRL4°®5N mutant complexes was determined by measuring total fluorescent 
intensity at equal concentration (excitation at 490 nm, emission at 540 nm) using 
a Safire2 microplate reader from Tecan (Extended Data Fig. 4f). 
Biotinylation of CK1o and IKZF1. Purified StrepII-Avi-tagged CK1a or IKZF1 
were biotinylated in vitro at a concentration of 25-50 ,.M by incubation with final 
concentrations of 2.51.M BirA enzyme and 0.2mM D-Biotin in 50 mM HEPES 
pH 7.4, 200 mM NaCl, 10 mM MgCl, 0.25 mM TCEP and 20 mM ATP. The reac- 
tion was incubated for 1 h at room temperature and stored at 4°C for 14-16 h. 
Biotinylated proteins were purified by gel filtration chromatography and stored at 
—80°C (StrepII-Avi-CK1a at ~25 1M, StrepII-Avi-IKZF1 at ~40 1M). 
Time-resolved fluorescence resonance energy transfer (TR-FRET). Increasing 
concentrations of Alexa-488-NEDD8-labelled CRL4C82N (ygCRL4C82N) were 
added to pre-mixed biotinylated IKZF1 at 80nM or CK1a at 100 nM, terbium- 
coupled streptavidin at 4nM (Invitrogen) and IMiDs or 2’-deoxyuridine at 51M 
(final concentrations) in 384-well microplates (Greiner, 784076) in a buffer- 
containing 50mM Tris pH 7.5, 100 mM NaCl, 0.1% pluronic acid and 1% DMSO 
(see also figure legends). Before TR-FRET measurements were conducted, the reac- 
tions were incubated for 15 min at room temperature. After excitation of terbium 
(Tb) fluorescence at 337 nm, emission at 490 nm (Tb) and 520nm (Alexa 488) were 
recorded with a 70 \1s delay to reduce background fluorescence and the reaction 
was followed over 1 h by recording 60 technical replicates of each data point using 
a PHERAstar FS microplate reader (BMG Labtech). The TR-FRET signal of each 
data point was extracted by calculating the 520/490 nm ratio. Data were analysed 
with GraphPad Prism 6 assuming equimolar binding of the probe to the receptor 
(gCRL4C8®9) using the following equations: 

The concentration of the receptor in the bound state, [Cxpouna], can be calculated 
for that setting by the law of mass action: 


[Crbouna] = (Kp + [Cptot] + [Crtot] — ((Kp + [Crtot] + [Crtot])” 
—4x [Cprot] x [Crtot])?°)/2 


Kp is the equilibrium constant for the dissociation, and [Cpror] and [Catot] are the 
total concentrations of the probe and the receptor, respectively. 

The Kp value can be calculated from the change in the fluorescence inten- 
sity, FI, observed by a titration of the receptor at constant probe concentrations 
according to: 


Flops = Flee + (Flpouna = Fliree) x (Kp + [Cptot] + [Crtot =~ ((Kp = [Cptot] 
+ [Crtot])? — 4  [Cptot] X [Crtot])°°)/(2 x [Crtot]) 


Fl,ps is the observed fluorescence intensity, and Flee and Flpouna the fluores- 
cence intensities of the probe in its free and its bound states, respectively. The 
assay window is described by the overall change in the fluorescence intensity, 
(Flbouna = Free). 

Counter titrations with unlabelled proteins were carried out by mixing 
neCRL4@5N at 0.5-1 {1M with 200 nM biotinylated CK1a or 160nM biotiny- 
lated IKZF1 in the presence of 8nM terbium-coupled streptavidin and IMiDs 
at 10-20,1M. After 15 min incubation on ice, increasing amounts of unlabelled 
DDB142?8_CRBN (0.04-40}1M) or IKZF1 (0.04-20)1M; wild-type or mutant forms 
or pre-incubated with consensus or control DNA as stated in the EMSA section) 
were added to the pre-assembled ygCRL4@°8N-CK 1a/IKZF1 complexes in a 1:1 
volume ratio and incubated for 5 min at room temperature. IMiD titrations were 
carried out by premixing ygCRL4°N, CK1a/IKZF1 and 8 nM terbium-coupled 
streptavidin before addition of increasing concentrations of each IMiD (0.005-5 1M) 
in a 1:1 volume ratio (see figure legends for final concentrations). The 520/490 nm 
ratios were plotted to calculate the half maximal effective concentrations (ECs) or 
half maximal inhibitory concentrations (ICs) assuming a single binding site using 
GraphPad Prism 6. ICs9 values were converted to the respective Kj as described*’. 
Three biological replicates were carried out per experiment. The standard deviation 
was derived from the sum of the mean absolute error of 10 technical replicates 
(per data point and replicate) and the standard deviation of the biological replicates. 
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Fluorescence polarization to monitor binding of CRBN mutants to IMiDs. 
Cy5-conjugated thalidomide! (10 nM) was mixed with increasing concentrations 
of either wild-type or mutant forms of purified DDB1-CRBN (0.004-2|1M) in a 
384-well microplate (Greiner, 784076) and incubated for 30 min at room temper- 
ature. CRBN-IMiD interactions were measured in 50 mM Tris pH 7.5, 100mM 
NaCl, 0.1% pluronic acid and 1% DMSO by change in fluorescence polarization 
using a PHERAstar FS microplate reader (BMG Labtech). The Cy5-thalidomide 
bound fraction was calculated as described*’. Data were plotted and analysed using 
GraphPad Prism 6 assuming a single IMiD binding site on CRBN. 

Label-free quantification mass spectrometry. Hct116 cells were purchased from 
The European Collection of Cell Cultures (ECACC, Sigma-Aldrich), immediately 
used for experiments and regularly tested for mycoplasma contamination. Cells 
were cultured in L-lysine and L-arginine free DMEM supplemented with unla- 
belled L-lysine, L-arginine, 10% FBS and 2mM t-glutamine. Cells were grown 
to approximately 50% confluency and the medium was exchanged for DMEM 
supplemented with 13C,!°N-labelled L-lysine (Lys8) and °C,!5N-labelled L-arginine 
(Arg10) containing lenalidomide at 30|1M or equivalent amounts of DMSO as 
control. Cells were incubated for 16 h and harvested for mass spectrometry anal- 
ysis in 0.5 M Tris-HCl pH 8.6, 6 M guanidine hydrochloride, reduced in 16 mM 
TCEP for 30 min, and alkylated in 35 mM iodoacetamide for 30 min in the dark. 
The proteins were digested at 37 °C with lysyl endopeptidase (Wako) after dilution 
to ~2 M guanidine hydrochloride (with 50 mM Tris-HCl pH 7.3, 5mM CaCly 
buffer) for 6 h, and after dilution to <1 M guanidine hydrochloride with trypsin 
(Promega) at 37°C overnight. The resulting peptides were desalted using Cg solid 
state extraction cartridges and offline fractionated into 36 fractions by basic reverse 
phase chromatography. The 36 fractions were recombined to a final of 12 samples. 

Generated peptides were separated on an EASY n-LC 1000 liquid chromatog- 
raphy system equipped with a C}; EASY-Spray column coupled to an Orbitrap 
Fusion mass spectrometer (all from Thermo Scientific). 

Maxquant"! was used for .RAW file processing and controlling peptide and 

protein level false-discovery rates, assembling proteins from peptides, and protein 
quantification from peptides. Peptides were searched against a human Uniprot 
database with both forward and reverse sequences. For analysis, we did not utilize 
the SILAC component of the experiment, but instead used the Maxquant LFQ 
algorithm to quantify the relative abundance of casein kinase isoforms across three 
independent replicates. 
In vitro ubiquitination and identification of target lysines. In vitro ubiquitina- 
tion was performed by mixing wild-type or mutant ygCRL4°®®N at 70nM with 
a reaction mixture containing IMiDs at 350nM or 10\1.M, CK1a at 500nM, E1 
(UBA1, BostonBiochem) at 40nM, E2 (UBCH5a, BostonBiochem) at 1 1M, wild- 
type (201M) or lysine-free (10|1.M) ubiquitin as indicated. Reactions were carried 
out in 50mM Tris pH 7.5, 30mM NaCl, 5mM MgCl2, 0.2mM CaCl, 1mM ATP, 
0.1% Triton X-100 and 0.1 mg ml! BSA, incubated for 15-30 min at 30°C and ana- 
lysed by western blot using anti-CK1« (abcam, ab108296, 1:20,000) and anti-rabbit 
IRDye 800CW antibodies (LI-COR, 926-32211, 1:10,000). Blots were scanned on 
a LI-COR Odyssey infrared imaging system. 

To identify target lysines following in vitro ubiquitination of CK1a in the 
presence or absence of lenalidomide, samples were precipitated with 20% (v/v) 


trichloroacetic acid followed by several acetone washes. The precipitated pro- 
tein was dissolved in 10 \1l of 500 mM Tris pH 8.6, 6 M guanidine hydrochloride 
and 8mM TCEP and incubated with 18 mM iodoacetamide for 30 min at room 
temperature in the dark. After addition of 50 1150 mM Tris pH 7.4, 5mM CaCh, 
samples were incubated with 200 ng trypsin at 37°C for 12-14 h. Mass spec was 
carried out on a FUSION Orbitrap, the data was searched with MASCOT, site 
probabilities were calculated with ASCOR and peak integration for relative quan- 
tification was performed using ProgenesisLC. 

Electrophoretic mobility shift assays (EMSAs). Equimolar ratios of comple- 
mentary DNA single strands at 200 mM were mixed in 10 mM Tris pH 8.0, 50 mM 
NaCl and 1mM MgCh, incubated at 95°C for 2 min and annealed by slowly 
lowing temperature using a thermal cycler. IKZF1 was mixed with duplex DNA 
(consensus DNA: 5’-TCAGAAAAAGGGAATTCCGTCAC-3’; control DNA: 
5'-TCAGACACTTTTGGTACTGTCAC-3’) in 50 mM HEPES pH 7.4, 200 mM 
NaCl, 0.25mM TCEP and 10% (v/v) glycerol and incubated on ice for 15 min 
before the addition of DDB1-CRBN and pomalidomide, as indicated. Binding 
reactions were incubated for 30 min at room temperature, applied to a 4-16% 
NativePAGE Novex Bis-Tris gel (Invitrogen) and separated in 1 x NativePAGE 
buffer at 150 V for 75 min at room temperature. Gels were stained with 
1 pg ml ethidium bromide in 1x NativePAGE buffer followed by Coomassie 
staining. 
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at 100nM and lenalidomide at 101M (blue curve). Addition of buffer to 
nsCRL4C88N_lenalidomide-CK1a (green curve; maximum signal) and 
nsCRL4C88N, CK1a without lenalidomide (red curve; minimum signal). 
ICs) was converted to the respective K; as described in methods. The 
resulting K; is similar to the Kp as determined by non-competitive TR- 
FRET (Fig. la). e, Binding of Cy5-labelled thalidomide to CRBN mutants 
(DDB1-CRBN) determined by fluorescence polarization. Data in this 
figure are biological replicates presented as means + s.d. (n = 3). 


Extended Data Figure 1 | Characterization of the CRL4©°8N_CK1a 
interaction. a, TR-FRET setup. b, Neddylation of CRL4C88N with Alexa 
488-labelled NEDD8. ¢, Titration of lenalidomide, pomalidomide and 
thalidomide (0.0025-2.5 1M) to ysCRL4C®5N at 300 nM and CK1aat 

100 nM. ECs» values show that similar compound concentrations are 
required to bind CK1« to ygCRL4°®®", d, Unlabelled DDB14®?8-CRBN 
competes with ngCRL4C®5N for CK1a binding. Titration of unlabelled 
wild-type DDB145?8_CRBN (0.02-20|1M) to ngCRL4©8%_lenalidomide- 
CK1a complexes pre-assembled by mixing ysCRL4°®"N at 450 nM, CK1a 
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DpB148°8 
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Extended Data Figure 2 | The asymmetric unit contains two quaternary _ crystal contained two quaternary complexes in the unit cell. The two 
complexes. a, DDB143?8_CRBN superimposes on DDB1-CRBN (PDB complexes are shown as cartoon representation coloured according to B 
entry 4TZ4; grey) with an overall r.m.s.d. of 0.96 A. b, CRBN helical factor. Lenalidomide is shown as spheres. Average B-factors for different 


bundle domains (HBD) extracted from a. The CRBN-HBDs superimpose with parts of the molecule are provided. 
an overall r.m.s.d. of 0.44 A. c, The DDB1°??-CRBN-lenalidomide-CK1la 
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Extended Data Figure 3 | CK1a adopts an active kinase conformation. 
a, C-terminal domain of CRBN, lenalidomide and CK1a. Elements 
required for kinase activation are indicated. b, Superposition of CK1a 
with inactive c-Src kinase (PDB entry 2SRC; orange) and an active 
conformation of human lymphocyte kinase Lck (PDB entry 3LCK; grey) 
shown in two different orientations. Conformational differences in the 
activation segment and the aC-helix are indicated. c, Close-up of the 


CK1a C-lobe 


DFG motif 
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Activation 
Segment CKia 
c-Src inactive 
~90° J. (2SRC) 
Lek active 
aC-helix (3LCK) 


CK1a 
Active 


CKia 
Active 


Inactive Inactive 


HRD motif 


aC-helix of CKla. The DFG motif makes hydrophobic contacts with the 
aC-helix. The Lys—Glu hydrogen bond (dashed line) is characteristic of an 
active kinase conformation. d, Superposition of the DFG motifs of CK1a, 
c-Src and Lck. The CK1a DFG motif adopts an active conformation. 

e, Superposition of the HRD motifs of CK1a, c-Src and Lck. The CK1a 
HRD motif adopts an active conformation. 
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Extended Data Figure 4 | CK1o binding induces minor changes in 
CRBN. a, Superposition of DDB1A??8-CRBN-lenalidomide-—CK1la 
(coloured as in Fig. 1c) with human DDB1-CRBN (PDB 4TZ4) in 

grey. Only the CRBN residues His353 and Tyr355 undergo minor 
rearrangements upon CK1a binding. b, Lenalidomide orientation (shown 
as spheres) in the DDB143?8_CRBN-lenalidomide-CK1«a complex. C3 

of the phthalimide group is close to the main chain carbonyl group of 
Glu377 (shown as spheres). c, Orientation of lenalidomide in the structure 
of human DDB1-CRBN (PDB entry 4TZ4; grey) superimposed with 
DDB14*?5_CRBN-lenalidomide-CK 1a. Clashes of phthalimide C6 


and C7 with CK1a are indicated by an arrow. d, e, Thalidomide (d) and 
pomalidomide (e) in the structure of Homo sapiens DDB1 and Gallus 
gallus CRBN (PDB entry 4CI1 and 4C]3) in grey. The 8-strands 10 and 
11 are in an identical conformation as compared to lenalidomide-bound 
human DDB1-CRBN (PDB entry 4TZ4) and are not differentially 
regulated by binding to different IMiDs'*. f, Total intensity measurements 
of different Alexa-488-labelled ygCRL4°®®N mutant complexes confirms 
comparable labelling efficiency (AU, arbitrary units). Data are biological 
replicates presented as means +s.d. (n=3). 
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Extended Data Figure 5 | Conservation of the CRBN-CK 1a interface. Asterisks indicate residues that contact CK1a. Orange dots mark residues 
a, Sequence of the interacting CK1a (-hairpin loop aligned to different required for IMiD binding. Arrows highlight the four diverging residues. 
CK1 and CK2 isoforms. b, Superposition of the CK1a and CK2a (PDB Human CRBN residues C366 and D428 are not part of the CRBN-CK1la 
entry 3U87) B-hairpin loops. CK2a residue Asn62 aligns with CK1la interface. e, Superposition of CRBN-lenalidomide-CK1a (coloured 
residue Gly40 and clashes with lenalidomide. c, Quantification of CK1la as in Fig. 1c) with chicken CRBN (PDB entry 4CI2) in grey. CRBN 

and CK2a cellular abundance using label-free quantification mass Pro352 and CK1a Gly40 make close contacts with the phthalimide ring 
spectrometry. Two-tailed paired t-test analysis revealed significant of lenalidomide. CRBN Val388 is close to CK1a Gly40 (arrow; middle 
differences between CK268, CK2a1/3, CK2a2 and CK1a with P < 0.05. panel). Substitution of Val388 for isoleucine leads to clashes with CKla, 
Data are biological replicates presented as means + s.d. (n = 3). as illustrated by chicken CRBN Ile390 (arrow; right panel). 


d, Sequence alignment of human and mouse CRBN C-terminal domains. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 6 | Characterization of the CRL4°®®N_IKZF1 
interaction. a, ysCRL4C®®S-IKZF1 complex formation measured by TR- 
FRET in the absence (DMSO) or presence of lenalidomide, pomalidomide 
or thalidomide at saturating concentrations (51M). b, Titration of 
lenalidomide, pomalidomide and thalidomide (0.0025-2.5 11M) to 
nsCRL4°R3N at 200 nM and IKZF1 at 80nM. ECs values show that equal 
compound concentrations are required to bind IKZF1 to ysCRL4C®8, 
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c, Titration of unlabelled wild-type DDB14®?®-CRBN (0.02-20 1M) 
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to ngCRL4°®®N_lenalidomide-IKZF1 complexes pre-assembled by 
mixing naCRL4C®2N at 250 nM, IKZF1 at 70 nM and pomalidomide at 
10M. Addition of buffer to ysCRL4°®®S—pomalidomide-IKZF1 (green 
curve; maximum TR-FRET signal) and ngCRL4©®®, IKZF1 without 
pomalidomide (red curve; minimum TR-FRET signal). ICso values were 
converted to the respective Kj as described in methods. Data in this figure 
are biological replicates presented as means + s.d. (n= 3). 
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Extended Data Figure 7 | IKZF1 utilizes a 8-hairpin loop to bind 
CRBN. a, Overview of IKZF1 constructs. ZF, zinc finger; Q, Gln146. 
Hatched lines indicate omitted sequences. b, Sequence alignment between 
human IKZF1-ZF2, the 8-hairpin loop of human CK1qa and Aart (PDB 
2113; residues 133-158). PDB entries 2113, 2LT7 and 1UBD were used 

for IKZF1 modelling. c, Superposition of modelled IKZF1-ZF2 with the 
CK1« 8-hairpin loop. d, Unlabelled wild-type or mutant IKZF1®"196/239-255 
(0.02-10 1M) added to ngCRL4°8®N_pomalidomide-IKZF 17199239755 
complexes pre-assembled by mixing nsCRL4°®"N at 250 nM, 
IKZF1!-196/239-255 at 80 nM and pomalidomide at 51M. ngCRL4C88N— 


[ygCRL4°P®"] in pM 


[yCRL4°"*"] in pM 


pomalidomide-IKZF1 complex (red curve; maximum TR-FRET signal). 
nsCRL4°85N, IKZF1 without pomalidomide (blue curve; minimum 
TR-FRET signal). e, CBRN-CK1a interface. f, Model of the CRBN-IKZF1 
interface. CRBN G1n390 is proximal to IKZF1 His163 in the a-helix of 
ZF2. g, Mutation of CK1a Glu41 to alanine, the corresponding residue in 
IKZF1, strengthens binding (Kp = 109 + 30 nM). h, Mutation of CRBN 
G1n390 to arginine has no substantial influence on CK1a binding, but 
interferes with IKZF1 recruitment (i) as predicted by the model. 

TR-FRET data in this figure are biological replicates presented as 

means +s.d. (n=3). 
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Extended Data Figure 8 | Presence of IKZF1 consensus DNA interferes 
with binding to CRBN. a, Model of IKZF1 bound to DNA and CRBN. 
DNA clashes with the zinc-binding region of CRBN. b, Electrophoretic 
mobility shift assay. IKZF159-19°5°-> specifically binds its consensus 
DNA (cons. DNA). ¢, Electrophoretic mobility shift assay. Presence of 
DNA affects the interaction between DDB1-CRBN, pomalidomide and 


IKZF1. DNA does not comigrate with the DDB1-CRBN-IKZF1 complex. 
DDB1-CRBN and IKZF1 do not interact in the absence of pomalidomide. 


d, IKZF1 pre-incubated with equimolar concentrations of DNA shows 


reduced binding to ysCRL4C88N in TR-FRET. Titration of unlabelled 
IKZF183-199/239-255 (992-10 41M) pre-incubated with control, consensus or 
no DNA. Unlabelled IKZF1 was added to ysCRL4°82N_pomalidomide- 
IKZF1 complexes pre-assembled by mixing ysCRL4°®8 at 250 nM, 
IKZF11-196/239-255 at 80 nM and pomalidomide at 51M. Addition of buffer 
to nsCRL4C®®N_pomalidomide-IKZF1 (red curve; maximum TR-FRET 
signal) and nsCRL4C88N, IKZF1 without pomalidomide (blue curve; 
minimum TR-FRET signal). Data are biological replicates presented as 
means +s.d. (n= 3). 
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Extended Data Figure 9 | In vitro ubiquitination of CK1a and 
identification of target lysines. a, In vitro ubiquitination of CK1a by 
ngCRL4C®5N in the presence different IMiDs at 101M ([IMiD] >> Kp). 
b, Model of CRL4“*®" bound to CK1a. Ubiquitin target lysines of CK1a 
are shown as spheres and represent sites identified by mass spectrometry 
following in vitro (yellow) or in vivo® (blue) ubiquitination. Lys62, 65 


and 179 are close to the CUL4 orientation, whereas Lys8 and 225 can be 
reached in CUL4’ orientation. c, TR-FRET of ngCRL4C88N-CK1la complex 
formation in the presence of lenalidomide or 2'-deoxyuridine at 51M. 
2'-deoxyuridine does not mediate interaction between ngCRL4°R"N and 
CK1a. Data are biological replicates presented as means + s.d. (n= 3). 
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Extended Data Table 1 | Data collection and refinement statistics 
DDB1*"??-CRBN-lenalidomide-CK 1a 


Data collection 


Space group Pl 
Cell dimensions 
a, b,c (A) 87.97, 109.93, 112.40 
a, B, y (°) 106.02, 93.19, 101.63 
Resolution (A) 65—2.45 (2.58—2.45) * 
Rirerge (%) 7.8 (158) 
CCy/2 (%) 99.8 (44.7) 
Tol 8.7 (0.8) 
Completeness (%) 97.3 (96.5) 
Redundancy 3.6 (3.7) 
Refinement 
Resolution (A) 65—2.45 
No. reflections 140745 
R york: Rive 18.07 / 21.02 
No. atoms 
Protein 23199 
Ligand/ion 40 
Water 305 
B-factors 
Protein 97.1 
Ligand/ion 78.3 
Water 70.2 
R.m.s deviations 
Bond lengths (A) 0.01 


Bond angles (°) 


1.18 


*Highest resolution shell is shown in parenthesis. 
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Structural basis of cohesin cleavage by separase 


Zhonghui Lin!’, Xuelian Luo** & Hongtao Yu! 


Accurate chromosome segregation requires timely dissolution of 
chromosome cohesion after chromosomes are properly attached 
to the mitotic spindle. Separase is absolutely essential for cohesion 
dissolution in organisms from yeast to man! ”. It cleaves the 
Kleisin subunit of cohesin and opens the cohesin ring to allow 
chromosome segregation. Cohesin cleavage is spatiotemporally 
controlled by separase-associated regulatory proteins, including 
the inhibitory chaperone securin*, and by phosphorylation of 
both the enzyme and substrates”~!*. Dysregulation of this process 
causes chromosome missegregation and aneuploidy, contributing 
to cancer and birth defects. Despite its essential functions, 
atomic structures of separase have not been determined. Here 
we report crystal structures of the separase protease domain 
from the thermophilic fungus Chaetomium thermophilum, alone 
or covalently bound to unphosphorylated and phosphorylated 
inhibitory peptides derived from a cohesin cleavage site. These 
structures reveal how separase recognizes cohesin and how cohesin 


phosphorylation by polo-like kinase 1 (PIk1) enhances cleavage. 
Consistent with a previous cellular study'’, mutating two securin 
residues in a conserved motif that partly matches the separase 
cleavage consensus converts securin from a separase inhibitor to 
a substrate. Our study establishes atomic mechanisms of substrate 
cleavage by separase and suggests competitive inhibition by securin. 

Separase belongs to the clan CD family of cysteine proteases which 
includes caspases'. It contains a large amino (N)-terminal armadillo 
(ARM) repeat domain and a highly conserved carboxy (C)-terminal 
separase protease domain (SPD) that consists of a pseudo-protease 
domain (PPD) and an active protease domain (APD)!* (Fig. la and 
Extended Data Fig. 1). Cohesin forms an asymmetric ring to top- 
ologically entrap chromosomes (Fig. 1a)!>!®. Separase cleaves the 
kleisin subunit to open the cohesin ring and trigger chromosome 
segregation. It also cleaves other substrates to regulate anaphase 
spindle elongation and centriole duplication'”'*. High-resolution 
structures of separase have not been determined more than a decade 
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Figure 1 | Structure of ctSPD. a, Domains and motifs of separase from 
C. thermophilum (top) and schematic drawing of cohesin (bottom). 

b, Sequence alignment of the cleavage sites of separase substrates; 

sc, Saccharomyces cerevisiae; sp, Schizosaccharomyces pombe; xl, Xenopus 
laevis; hs, Homo sapiens. c, Autoradiograph of the ctSPD cleavage 

assay with *°S-ctSccl1 wild type (WT) or non-cleavable mutant (NC) as 


Inactive 


Caspase 9 


substrates. For gel source data, see Supplementary Fig. 1. d, Cartoon of the 
crystal structure of ctSPD. L4 is coloured magenta. Loops with no visible 
electron densities are indicated by dashed lines. e, Cartoon of caspase 9 
(Protein Data Bank accession number 1JXQ), with the bound inhibitor 
shown as yellow sticks. 
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Figure 2 | Contributions of the L4 loop and the helical insert to the 
activity of ctSPD. a, Cartoon of ctSPD with the catalytic dyad and L4 
loop residues shown. b, d, Quantification of the protease activity of ctSPD 
WT and mutants (mean + s.d., n= 3 independent experiments). Mutants 
with activities greater or less than twofold that of WT are in blue and red, 
respectively. c, Interactions between the N-terminal tag and a surface 
pocket of ctSPD. e, Autoradiograph of the *°$-ctSccl cleavage assay by 
ctSPD WT or mutants. Bottom: Coomassie-stained gel of ctSPD proteins. 
f, A conserved basic pocket in ctSPD, with the 2F, — F, map of the bound 
citrate shown at 2.00. 


since its discovery, hindering our understanding of its mechanism 
and regulation. 

We found that SPD of C. thermophilum (ct) separase could be 
expressed in large quantities in bacteria without securin (Extended 
Data Fig. 2a). Recombinant ctSPD, but not the C2110S mutant, 
cleaved ctSccl to produce two major fragments (Extended Data 
Fig. 2b). Separase is known to cleave after the EXXR (X, any residue) 
consensus motif”. Charge-reversal mutation of the 7!7EVGR?!* motif 
in ctSccl reduced cleavage by separase (Fig. 1b, c). An acyloxymethyl 
ketone (AMK)-containing peptide inhibitor derived from this cleavage 
site blocked ctSccl cleavage in a dose-dependent manner (Extended 
Data Fig. 2c, d), and retarded the gel mobility of ctSPD™’, but not 
ctSPD@!!S, consistent with covalent inhibition (Extended Data 
Fig. 2e). Similar to separases from other species®, longer constructs of 
ctSPD containing an N-terminal extension underwent autocleavage 
at the '*8ELAR!™® site (Fig. 1b and Extended Data Fig. 2f). Thus, 
recombinant ctSPD was active. 

We determined the crystal structure of ctSPD (Fig. 1d and Extended 
Data Table 1). It forms one globular domain with two sub-domains— 
the PPD and the APD—that pack against each other. APD has an 
overall fold similar to that of caspases (Fig. 1d, e and Extended Data 
Figs 3 and 4a). PPD also has a mixed «/f fold, but its central 3-sheet 
has a topology different from that of caspases. One edge of this central 
sheet of PPD forms an edge-on interaction with that of APD, whereas 
the other edge is capped by a helical domain in PPD. A prominent 
helical insert of PPD forms a long coiled-coil and packs against APD. 

The catalytic dyad H2083 and C2110 are located in loops L3 and L4 
of APD (Fig. 1d and Extended Data Fig. 3a). An important mechanism 
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Figure 3 | Structural basis of Sccl cleavage by separase. a, Cartoon of 
ctSPD bound to pAMK (shown as sticks overlaid with the 2F, — F. map 

at 1.00). b, Cross-sectional view of the surface drawing of ctSPD-ctSccl 
coloured with its electrostatic potential (blue, positive; red, negative; white, 
neutral). The bound Sccl peptide is shown as sticks. c, The $1 pocket of 
ctSPD that recognizes the P1 arginine. Dashed lines indicate hydrogen 
bonds or favourable electrostatic interactions. d, Autoradiograph of the 
cleavage reaction of cSPD WT and mutants with *°S-ctSccl as substrate. 
Active-site/S1 mutants are labelled red; $4 mutants are labelled blue. 
Bottom: Coomassie-stained gel of the ctSPD proteins. Asterisk marks 

an aberrant cleavage product of D2151A. e, The pS210-binding site. 

f, Autoradiograph of the cleavage reactions of ctSPD WT and mutants with 
35§-ctSccl as substrate, with or without a prior incubation with hsPlk1. 


of pro-caspase activation is the reorganization of L4, which can be 
achieved through homo-dimerization, cleavage of an internal linker, or 
both'?-**, The geometry of the catalytic dyad and the extended confor- 
mation of L4 in ctSPD are similar to those in active caspase 9 (Fig. 1d, e), 
consistent with ctSPD being an active enzyme. Thus, separase acti- 
vation does not require proteolytic cleavage of L4. Consistent with 
the importance of the L4 loop, mutations of two residues adjacent to 
C2110, M2108 and $2112, reduced the activity of cfSPD (Fig. 2a, b and 
Extended Data Fig. 4b). In contrast, mutations of L4 residues distal to 
C2110, including E2120 and F2121, enhanced the activity of ctSPD. 

A segment of the N-terminal tag of recombinant ctSPD binds to 
a conserved surface pocket in PPD adjacent to L4 (Figs 1d, 2c and 
Extended Data Fig. 4c). Although this tag is not required for the 
activity of cfSPD, mutations targeting residues in the tag-binding 
pocket altered the activity of ctSPD containing the tag (Fig. 2d and 
Extended Data Fig. 4d). Similar to mutations of the distal L4 residues, 
the D1698K and D1960K mutations enhanced the activity of ctSPD. 
We propose that securin or other regions of separase may bind to this 
tag-binding site, alter the conformation of L4, and affect the protease 
activity of separase. Even without bona fide ligands, binding of an 
artificial tag to this site can regulate the protease activity of ctSPD ina 
subtle way. 

Unlike active caspase 9, which forms a homodimer”, separase 
contains an internal PPD in the same polypeptide chain that packs 
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Figure 4 | Securin as a pseudo-substrate of separase. a, Sequence 
alignment of the EVE motif of securin, with the separase cleavage 
consensus shown above. The KEN and destruction boxes (D) are 
indicated. ®/(, hydrophobic/hydrophilic residues. b, Autoradiograph 
of the cleavage reactions of *°S-ctsecurin WT or P164R/P165D (RD) by 
ctSPD WT or C2110S with or without the AMK inhibitor. 


against and stabilizes its APD. In particular, the helical insert of PPD 
makes extensive contacts with APD and bridges the two sub-domains 
(Extended Data Fig. 5a, b). Deletion of the helical insert or mutations 
of key residues at the helical insert-APD interface, including D1805 
and W2143, abolished the expression of soluble ctSPD in bacteria 
(Fig. 2e and Extended Data Fig. 5a—c). Several helical-insert resi- 
dues, including C1782 and H1783, are located close to the active site 
(Extended Data Fig. 5a). Mutations of these residues did not affect 
the solubility of ctSPD, but reduced the protease activity (Extended 
Data Fig. 5d). Moreover, residues from the tip of the helical insert, 
along with residues from APD, form a basic pocket that binds a cit- 
rate molecule (Fig. 2f). Mutations of these conserved residues, with 
the exception of R1794E, diminished separase activity (Fig. 2e and 
Extended Data Fig. 1). Therefore, the helical insert is critical for both 
the structure integrity and activity of separase. 

Phosphorylation of Scc1 by Plk1 enhances Scc1 cleavage by sepa- 
rase’®!! This cleavage-enhancing phosphorylation is opposed by the 
shugoshin-PP2A complex bound to cohesin**?”, Incubation of ctSccl, 
but not ctSccl $210A, with human Plk1 (hsPlk1) enhanced the cleav- 
age of ctSccl by ctSPD (Extended Data Fig. 6a). Addition of the hsPlk1 
inhibitor BI2536 blocked this enhancement. The phospho-mimick- 
ing $210E mutation stimulated ctSccl cleavage by separase (Extended 
Data Fig. 6b). Thus, Plk1-dependent phosphorylation of ctScc1 at $210 
enhances Sccl cleavage by separase. 

We next determined the crystal structures of ctSPD bound to 
unphosphorylated (AMK) or phospho-S210-containing (pPAMK) 
inhibitors (Extended Data Table 1). The overall structure of ctSPD- 
AMK and ctSPD-pAMK complexes is virtually identical to that of free 
ctSPD, indicating that substrate binding does not induce notable con- 
formational changes. Only the C-terminal *’7EVGR?!* segment of the 
unphosphorylated AMK inhibitor was visible (Extended Data Fig. 6c), 
whereas all residues of the pAMK inhibitor had clearly defined elec- 
tron density (Fig. 3a). In both structures, the active-site cysteine C2110 
is covalently linked to ctScc1 R215 at the P1 position. Consistent with 
its role in stabilizing the oxyanion during catalysis, ctSPD H2083 of the 
catalytic dyad is located close to carbonyl group of ctSccl R215. R215 
forms a salt bridge with ctSPD D2151 at the base of a deep, acidic S1 
pocket (Fig. 3b, c). ctSccl E212 at P4 inserts into the aforementioned 
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c, Autoradiograph of the cleavage reactions of *°S-ctsecurin®” by the 
indicated ctSPD proteins. d, Model depicting specificity determinants, 
phospho-regulation, and securin inhibition of separase-dependent 
cohesin cleavage. SA, stromal antigen. Cohesin cleavage by separase can be 
stimulated by DNA’. The ARM domain of separase might contact DNA. 


citrate-binding pocket, forming favourable electrostatic and hydrogen 
bonding interactions (Fig. 3b and Extended Data Fig. 6d). ctSccl V213 
and G214 form minimal contacts with ctSPD. Mutations of residues 
lining the S1 and S4 pockets in ctSPD greatly diminished separase 
activity (Fig. 3d). 

The S1 pocket mutant D2151A of ctSPD cleaved ctSccl at a differ- 
ent site (Fig. 3d). The ctSccl E180K mutation abolished this aberrant 
cleavage (Extended Data Fig. 6e), indicating that the mutant separase 
cleaved the '8°ELGM'® site. Thus, D2151 not only selects for basic 
residues but also discriminates against hydrophobic residues at P1. 
ctSPD charge reversal mutants D2151R and R2152E did not efficiently 
cleave the complementary charge reversal mutants of cfSccl (Extended 
Data Fig. 6f), indicating that other residues in the S1 and $4 pockets 
contribute to substrate recognition. Because most residues lining 
the S1 and S4 pockets are conserved among separases in all species 
(Extended Data Figs 1 and 7), our analyses establish the basis for the 
EXXR substrate specificity of separase. 

Phosphorylation of the substrate does not alter the binding mode of 
EVGRat P1-P4, but reveals or establishes additional contacts at P5 and P6. 
1211 at the P5 position packs against W1797 of the helical insert (Fig. 3e). 
ctSccl 1211A was less efficiently cleaved by ctSPD with or without Plk1 
(Extended Data Fig. 8a). Thus, as reported previously”, the hydrophobic 
residue at P5 contributes to substrate specificity. Phospho-S210 at P6 
makes favourable electrostatic interactions with R1794 of the helical 
insert and R2148 of APD in ctSPD (Fig. 3e and Extended Data Fig. 8b). 
Single mutation of R1794 or R2148 reduced the stimulation of Scc1 cleav- 
age by Plk1, whereas the double mutation abolished the effect (Fig. 3f 
and Extended Data Fig. 8c, d). Unlike R2148A, R1794E does not affect 
the cleavage of unphosphorylated Scc1. Therefore, R1794 specifically 
serves as a receptor for pS210. R2148 contributes to the recognition of 
both pS210 and E212. The serine at P6 is conserved in fungal Sccl and 
other separase substrates (Fig. 1b). The N-terminal separase cleavage site 
in vertebrate Sccl contains a phospho-mimicking, acidic residue at that 
position. Our structures thus explain the phosphorylation dependency 
of cohesin cleavage, and further suggest that this phospho-regulation 
might apply to other separase substrates. 

Finally, we probed the mechanism by which securin inhibits sep- 
arase. Securin blocks substrate access to the active site of separase*”®. 
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Expectedly, the ctsecurin-ctseparase complex was less active in ctScc1 
cleavage, compared with ctSPD (Extended Data Fig. 9a, b). A con- 
served EVE motif in securin matches the separase cleavage consensus 
at positions P2—P6, but lacks the arginine at P1 and often has a pro- 
line at PO instead of a hydrophilic residue (Fig. 4a). A securin mutant 
with three residues in this motif mutated was cleaved by separase in 
fission yeast cells’. We thus mutated P164 and P165 in ctsecurin to 
Rand D, the matching cfSccl residues at P1 and PO. The resulting 
ctsecurin®” mutant was efficiently cleaved by ctSPD, and this cleavage 
was inhibited by the AMK inhibitor (Fig. 4b). Mutating the phospho- 
serine-binding residues in ctSPD or E159 in ctsecurin®? reduced cleav- 
age (Fig. 4c and Extended Data Fig. 9c), indicating that this artificial 
substrate bound at the canonical substrate-binding sites of separase. 
ctsecurin bound tightly to the N-terminal ARM domain of ctseparase 
(Extended Data Fig. 9d). A synthetic EVE-containing securin peptide 
did not inhibit ctSPD (Extended Data Fig. 9e). We propose that securin 
acts as a pseudo-substrate to competitively block substrate binding to 
separase (Fig. 4d). Securin binding to the ARM domain of separase 
provides the necessary avidity for securin to outcompete authentic 
substrates for access to the active site. Securin is not cleaved because 
of incompatible residues at the site of cleavage. 

As a crucial protease that triggers chromosome segregation, sepa- 
rase is a potential oncoprotein”. Because of the conserved principles of 
substrate recognition (Extended Data Fig. 7), our structure of an active 
fungal separase can guide the rational design of chemical inhibitors of 
human separase, which may have therapeutic potential. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Expression and purification of ctSPD. The ctseparase cDNA (GenBank 
identity 18261092) was synthesized at GenScript USA. For the expression 
of the ctSPD, the cDNA fragment of ctSPD1°*?-???3 was subcloned into a 
modified pET bacterial expression vector. The pET-ctSPD vector encoded 
ctSPD'°??-2223 with an N-terminal Hiss tag of the following sequence: 
MGSSHHHHHHSQLEVLFQGPLGSGRP. The pET-ctSPD vector was transformed 
into Escherichia coli strain BL21(DE3). Protein expression was induced with iso- 
propylthiogalactoside (IPTG) at 18°C overnight. The bacteria were harvested and 
resuspended in the lysis buffer (50 mM Tris-HCl, pH 8.0, 200 mM NaCl, 5% glyc- 
erol, 1mM DTT, and 0.05% Triton X-100). After sonication and centrifugation, 
the supernatant was applied to Ni**-NTA resin (Qiagen). After extensive washing, 
His¢-ctSPD was eluted from the Ni?+-NTA column. Hisg-ctSPD was further 
purified with a mono Q 5/50 GL anion-exchange column (GE Healthcare) and a 
Superdex 200 10/300 GL column. The point mutants of ctSPD were generated with 
a QuikChange Lightning Site-Directed Mutagenesis kit (Agilent Technologies). 
The truncated variants and point mutants of ctSPD were expressed and purified 
similarly. Because ctSPD'™*-?*?3 underwent autocleavage at the '*ELAR'° 
site, we generated a non-cleavable 168RLAE'© mutant to prevent autocleavage 
and increase yield. All ctSPDs in this study, except that in Extended Data Fig. 2f, 
contained the non-cleavable mutation. 

The selenomethionine (SeMet)-labelled ctSPD1°-*”"3 was produced with the 
methionine biosynthesis inhibition method*!. Briefly, bacteria transformed with 
pET-ctSPD cultured overnight were pelleted, washed, and resuspended with M9 
minimal media. The bacteria were further incubated at 37°C until the absorbance 
at 600 nm reached about 1.0. Methionine biosynthesis was inhibited by the addition 
of the amino-acid solution containing 50 mg! of Leu/Ile/Val and 100 mg1~! of 
Phe/Lys/Thr/SeMet. Protein expression was induced with 0.4mM ITPG at 18°C 
overnight. The SeMet-labelled protein was subsequently purified through the same 
procedure as described above. 

Expression and purification of ctseparase-securin complex and 
ctSPD'5°!2223. The ctsecurin cDNA (GenBank identity 18256826) was cloned from 
a cDNA library of C. thermophilum. The cDNAs of ctseparase and ctsecurin were 
separately subcloned into a modified pFastBac HT vector (Invitrogen). The final 
constructs encoded an N-terminal His¢-Strep-tagged ctseparase and an N-terminal 
His¢-tagged ctsecurin. Baculoviruses of ctseparase (full-length or residues 1-1500) 
and ctsecurin were constructed with the Bac-to-Bac system (Invitrogen) according 
to the manufacturer’s protocols. Sf9 cells were co-infected with ctseparase and 
ctsecurin baculoviruses and harvested at 48h after infection. Cells were resus- 
pended in the lysis buffer containing 50 mM Tris-HCl, pH 8.0, 200 mM NaCl, 
5% glycerol, 1mM DTT, and 0.05% Triton X-100, followed by sonication and 
centrifugation. The supernatant was applied onto a Strep-Tactin Superflow column 
(Qiagen). After extensive washes with the lysis buffer, the ctseparase-securin com- 
plex was eluted with the elution buffer containing 5 mM d-Desthiobiotin (Sigma- 
Aldrich), 50 mM Tris-HCl, pH 8.0, 200 mM NaCl, 5% glycerol, and 1mM DTT. 
The Hisg-ctSPD'**!-”3 protein was expressed in Sf9 cells with a similar strategy 
and purified through a Ni?*-NTA column. 

Separase activity assay. The ctSccl cDNA (GenBank identity 18259702) was syn- 
thesized by GenScript USA and was cloned into a modified pCS2 vector with a SP6 
promoter. To produce *°S-ctSccl or its mutants, the pCS2-ctSccl plasmids were 
added to a TNT Quick Coupled Transcription Translation System (Promega) and 
incubated in the presence of *S-methionine at 30°C for 90 min. (The *°S-ctSccl 
proteins migrated as a doublet on SDS-polyacrylamide gel electrophoresis (SDS- 
PAGE), possibly owing to proteolysis or internal methionine initiation during the 
in vitro translation reaction.) Then, 211 of *°S-ctSccl was added to 18 il of ctSPD 
(~1.5 1M) or ctseparase-securin protein solution containing 25 mM HEPES (pH 
7.5), 75mM KCl, 5mM MgCh, 1mM DTT, 15mM Nak, 1mM EGTA, 10% glycerol, 
and 0.05% Triton X-100, and incubated at 30°C for 60 min. For assays in Fig. 1c 
and Extended Data Fig. 2b, a higher concentration of ctSPD (3.0|1M) was used, 
resulting in more complete ctScc1 cleavage. For AMK inhibition assay, ctSPD was 
pre-incubated with the cfSccl-AMK inhibitor (synthesized by KareBay Biochem) 
at room temperature (25°C) for 30 min and further incubated with *°S-ctSccl at 
30°C for 60 min. In the PIk1 stimulation assay, *°S-ctSccl was pre-treated with 
recombinant GST-hsPlk1!?!"P in the kinase buffer for 30 min at 30°C, in the 
absence or presence of 10j1M B12536, and further incubated with ctSPD (0.541M) 
for 60 min. The reaction mixtures were separated on SDS-PAGE gels, which were 
stained, destained, dried, and analysed with Fuji or GE phosphoimagers. 
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The ctsecurin WT and P164R/P165D (ctsecurin®?) cDNAs were subcloned 
into pCS2-Myc vector with an SP6 promoter. The *°S-ctsecurin? and ctsecurin®” 
proteins were produced and assayed as described above. The ctsecurins3-177 (DPLQ 
VEEVEYAPPKPKEMPYESDVE) and ctsecurin}s3_177 3A (DPLQVEAAAYAPP 
KPKEMPYESDVF) peptides were chemically synthesized and tested for their 
ability to inhibit the cleavage of ctSccl by ctSPD as described above. 
Crystallization and data collection. All crystallization experiments were per- 
formed at 20°C. Initial screens were performed with a Phoenix crystallization 
robot (Art Robbins Instruments), using the commercially available screening 
kits from Hampton Research, Qiagen, and Molecular Dimensions. Conditions 
obtained from the initial screens were optimized using hanging-drop vapour 
diffusion method. Diffraction-quality crystals were obtained by repeated micro- 
seeding. All crystals were cryoprotected with a reservoir solution supplemented 
with 15% glycerol. 

Both native and SeMet-labelled ctSPD'*?-***? crystals were grown by mixing 
equal volumes of the protein solution (11 mgml~') with the precipitant solution 
containing 0.2 M ammonium citrate tribasic (pH 7.0), 20% PEG3350, and 10mM 
DTT. Diffraction data were collected at beamline BL8.2.1 at the Advanced Light 
Source (Lawrence Berkeley National Laboratory) at the wavelength of 0.9786 A at 
100 K and processed with HKL3000 (ref. 32). 

For crystallization of ctSPD1°°?-???_AMK and ctSPD1©?-?23_pAMK com- 

plexes, the purified ctSPD proteins were mixed with the ctSccl-AMK or phos- 
pho-ctSccl-AMK peptide inhibitors (KareBay Biochem) at a molar ratio of 1:2.5, 
and incubated overnight at room temperature to form covalent complexes as 
monitored by SDS-PAGE. The complexes were further purified with a Superdex 
200 10/300 GL size-exclusion column in the buffer containing 20mM Tris-HCl (pH 
8.0), 200 mM NaCl, and 5mM DTT. Crystals of ctSPD!?-?223_ A MK were grown by 
mixing equal volumes of the protein solution (13 mg ml’) with the precipitant solu- 
tion containing 0.1 M ammonium citrate tribasic (pH 7.0) and 12% PEG3350. For 
the crystallization of ctSPD1?-??73-pAMK complex, the 11 mgmI protein solu- 
tion was mixed with an equal volume of the precipitant solution containing 0.2 M 
KCl, 50 mM HEPES (pH 7.5), 32% pentaerythritol propoxylate (5/4 PO/OH), and 
10 mM DTT. Diffraction data for cfSPD1®?-???3__A MK and ctSPD1©3-?223_p AMK 
were collected at beamline 19-ID (SBC-CAT) at the Advanced Photon Source 
(Argonne National Laboratory) at 100K at wavelengths of 0.9793 A and 0.9795 A, 
respectively, and processed with HKL3000. 
Structure determination and refinement. The crystal of SeMet-labelled 
ctSPD1°>-2223 diffracted to a minimum Bragg spacing of 2.20 A and exhibited the 
symmetry of space group P2,22) with cell dimensions of a=55.67 A, b=98.79 A, 
c= 107.76 A. Phases were obtained from the selenium single-wavelength anoma- 
lous diffraction method. With data truncated to 2.5 A, nine of ten possible selenium 
sites were located and refined with PHENIX AutoSol*?, resulting in an overall 
figure of merit of 0.323. The experimental electron density map was used to con- 
struct an initial model with automated building with PHENIX AutoBuild. Asa 
result, 414 of total 587 residues were built in the initial model, with Ryo. and 
Ryree of 27.74% and 32.79%, respectively. Iterative model building and refine- 
ment were performed with JLigand*4, COOT*S, and PHENIX. Phases of native 
ctSPD1063-2223, c¢SP)1632-2223_ A MK, and ctSPD!©°3-??23_pAMK were obtained 
by molecular replacement with Phaser using the SeMet crystal structure as the 
search model. Data collection and structure refinement statistics are summarized 
in Extended Data Table 1. Ramachandran statistics (favoured/allowed/outlier 
(%)) calculated by MolProbity*® for ctSPD1°°-??73, ctSPD1692-?223_ A MK, and 
ctSPD'?-7?23_b A MK were 98.0/1.6/0.4, 98.1/1.9/0.0, and 98.1/1.7/0.2, respec- 
tively. All structural figures were generated with the program PyMOL (http://www. 
pymol.org/) using the same colour and labelling schemes. 
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%& Catalytic dyad @ Site 1 residues that bind P1 arginine = @ Site 4 residues that bind citrate and P4 aspartate 
@ Site 6 residues that bind P6 phospho-serine 


Extended Data Figure 1 | Sequence alignment of the SPDs from (PPD, blue; APD, green; the helical insert in PPD, cyan). Abbreviations: 
multiple species. The alignment is generated using the online ESPript ct, Chaetomium thermophilum; sc, Saccharomyces cerevisiae; sp, 
2.0 server. Secondary structural elements of ctSPD are indicated above Schizosaccharomyces pombe; xt, Xenopus tropicalis; hs, Homo sapiens. 


the sequences, with the same labelling and colour schemes as in Fig. 1d 
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Extended Data Figure 2 | Purification, activity, inhibition, and 
autocleavage of active ctSPD. a, Coomassie-stained gel of purified 
recombinant ctSPD wild type (WT) and C2110S. b, Autoradiograph 

of the ctSPD cleavage assay with *°S-ctSccl as substrate. c, Chemical 
structure of the acyloxymethyl ketone (AMK) inhibitor derived from the 
ctSccl cleavage site. d, Autoradiograph of the ctSPD cleavage assay with 
35§-ctSccl as substrate, in the absence or presence of increasing doses of 
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the AMK inhibitor depicted in c. e, Coomassie-stained SDS-PAGE gel 

of purified recombinant ctSPD WT or C2110S treated with the indicated 
doses of the ctSccl-AMK peptide inhibitor. The positions of unmodified 
ctSPD and ctSPD-inhibitor conjugates are indicated. f, Coomassie-stained 
gel of recombinant ctSPD'**?-**?3 WT or non-cleavable (NC) mutant. 

The ctSPDN© mutant contains the E1643R and R1646E mutations. 

The positions of intact and autocleaved ctSPD proteins are indicated. 
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ctSPD hsCaspase 9 dimer 


Extended Data Figure 3 | Comparison between the folding topologies of ctSPD and the caspase 9 dimer (Protein Data Bank accession number 
1JXQ). The labelling and colour schemes are the same as in Fig. 1d, e. H, the catalytic histidine; C, the catalytic cysteine. 
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Extended Data Figure 4 | Contributions of the L4 loop and a surface 
pocket to the protease activity of ctSPD. a, Cartoon of the crystal 
structure of ctSPD, with the PPD coloured blue, the APD coloured green, 
the helical insert in PPD coloured cyan, and an N-terminal tag peptide 
coloured yellow. The N and C termini are indicated. All secondary 
structure elements are labelled. Loops with no visible electron densities 
are indicated by dashed lines. Loop 4 (L4) is coloured magenta. H2083 
and C2110 of the catalytic dyad are shown as sticks. The orientation of 
ctSPD in this figure is related to that in Fig. 1d by a 180° rotation along the 
vertical axis. c, Representative autoradiograph of the *°S-ctScc1 cleavage 
assay by WT ctSPD or the indicated mutants. Bottom: Coomassie-stained 
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gel of ctSPD proteins used in the assay. Quantification of the relative 
protease activities of ctSPD WT and mutants is shown in Fig. 2b. 

The protease activity is defined as the ratio between intensities of the 

two major ctSccl cleavage products and that of the uncleaved ctSccl. 

c, Cartoon of the crystal structure of ctSPD, in the same orientation as in 
Fig. 1d. The tag peptide (HSQLEVLFQGP) is shown as sticks, overlaid 
with its 2F, — F, electron density map contoured at 1.00. d, Representative 
autoradiograph of the *°S-ctScc1 cleavage assay by WT ctSPD or the 
indicated mutants. Bottom: Coomassie-stained gel of ctSPD proteins used 
in the assay. Quantification of the relative protease activities of ctSPD WT 
and mutants is shown in Fig. 2d. 
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Extended Data Figure 5 | Interactions between the helical insert and (+) isopropyl 3-p-1-thiogalactopyranoside (IPTG) and eluates from 
the APD. a, b, Zoomed-in views of cartoons of ctSPD in two orientations Ni?+-NTA beads that had been incubated with the IPTG lysates. 
that are related by a 180° rotation along the vertical axis. Residues at the d, Autoradiograph of the *°S-ctScc1 cleavage assay by WT ctSPD or the 
interface between the helical insert of the PPD and the APD are shown indicated mutants. Bottom: Coomassie-stained gel of ctSPD proteins used 
in sticks and labelled. c, Coomassie-stained gel of lysates of bacteria in the assay. 


expressing the indicated ctSPD mutants and treated without (—) or with 
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Extended Data Figure 6 | Phospho-regulation and specificity 
determinants of separase-mediated cohesin cleavage. a, Autoradiograph 
of the ctSPD cleavage reactions of *°S-ctSccl WT or $2104, treated with 
or without human (hs) Plk1 or its inhibitor BI2536. b, Autoradiograph 
of the ctSPD cleavage reactions, with *°S-ctSccl WT or the phospho- 
mimicking $210E as substrates. c, Zoomed-in view of the cartoon of 
ctSPD bound covalently to the ctSccl-AMK inhibitor. The catalytic dyad 
residues C2110 and H2083 are shown as red sticks. The covalently bound 
inhibitor is shown as yellow sticks, overlaid with its 2F, — F- electron 
density map contoured at 1.00. d, Zoomed-in view of the S4 pocket of 
ctSPD that recognizes the P4 glutamate. Dashed lines indicate hydrogen 
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bonds or favourable electrostatic interactions. The orange sphere indicates 
a water molecule. e, Mapping of the aberrant ctSccl1 cleavage site by ctSPD 
D2151A. Top: sequence alignment of the aberrant site of D2151A and 

the major site of WT. Bottom: autoradiograph of the cleavage reactions 

of cfSPD™ or ctSPD??!5!4 with the indicated °S-ctScc1 proteins as 
substrates. Asterisk marks the aberrant cleavage product by ctSPD??!9!4. 
f, Charge reversal mutants of ctSPD fail to cleave complementary charge 
reversal mutants of ctSccl. Autoradiograph of the cleavage assay of 

WT ctSPD or the indicated mutants, with *°S-ctSccl WT or mutants as 
substrates. Bottom: Coomassie-stained gel of ctSPD proteins used in the 
assay. 
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ctSPD 
hsSPD (model) 
180° 


K1773 


model of hsSPD was generated with SWISS-MODEL. The coordinates of 


the model are available upon request. b, c, Zoomed-in views of the S1 and 
S4 pockets of the hsSPD model. 


Extended Data Figure 7 | Conservation of substrate-binding residues 
in human separase. a, Two different views of the cartoon of the structure 
of ctSPD-pAMK (green) and a homology model of human (hs) SPD 
(magenta). The phospho-AMK peptide is shown in sticks. The homology 
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Extended Data Figure 8 | Structural basis of phosphorylation- 
stimulated Sccl1 cleavage. a, Autoradiograph of the ctSPD cleavage 
reactions of *°S-ctSccl WT or I211A, treated with or without AsPIk1. 

b, Zoomed-in view of the surface drawing of ctSPD-pAMK. The surface 
is coloured according to the electrostatic potential, with red, blue, and 
white representing negative, positive, and neutral charges, respectively. 
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The covalently bound peptide is shown as sticks. c, Coomassie-stained gel 
of the indicated ctSPD proteins used in the assays described in Figs 3f and 
4c. d, Quantification of the fold of Plk1 stimulation in ctSccl cleavage by 
ctSPD WT and the indicated mutants as described in Fig. 3f. Error bars, 
s.d. (n =3 independent experiments). 
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Extended Data Figure 9 | Interactions between ctsecurin and ctseparase. 


a, Coomassie-stained gel of recombinant ctseparase-ctsecurin complexes 
and ctSPD expressed in insect cells. FL, full length. b, Autoradiograph of 
the ctSccl cleavage reactions by the ctseparase-ctsecurin complexes and 


ctSPD. c, Autoradiograph of the cleavage reactions of *°S-ctsecurin WT or 


mutants with or without ctSPD. d, Coomassie-stained gel of recombinant 


Strep-tagged ctseparase _)509 or the ctseparasej_1599—ctsecurin complex 
bound to Strep-Tactin beads. e, Autoradiograph of the ctScc1 cleavage 
reactions by ctSPD, in the absence or presence of varying concentrations 
of the ctsecurins3_177 or ctsecurin,s3_177 3A peptides. The EVE motif is 
mutated to AAA in the ctsecurin,s3_177 3A peptide. 
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Extended Data Table 1 | Data collection and refinement statistics 


Data collection 
Space group 
Cell dimensions 

a, b,c (A) 

a, By (°) 
Resolution (A) 
Rimerge (%) 

I/ol 
Completeness (%) 


Redundancy 


Refinement 

Resolution (A) 

No. reflections 

Rwork/ Riree 

No. atoms 
Protein 
Ligand/ion 
Water 

B-factors 
Protein 
Ligand/ion 
Water 

R.m.s deviations 
Bond lengths (A) 


Bond angles (°) 


ctSPD!6-2223 


P2,2;2, 


55.56, 98.89, 107.75 
90, 90, 90 

50.00-1.90 (1.93-1.90) * 
7.4 (97.0) 

27.7 (2.5) 

100 (100) 

7.2 (7.3) 


49.38-1.90 
46925 


17.3 / 20.0 


0.012 


1.24 


Data were collected from one crystal for each structure. 
*Highest-resolution shell is shown in parenthesis. 
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ctSPD'°???3_AMK 


P6322 


149.15, 149.15, 115.63 
90, 90, 120 

50.00-3.10 (3.15-3.10) 
15.5 (100) 

14.6 (1.3) 

99.9 (98.7) 

9.6 (8.7) 


48.82-3.10 
14228 


19.4/25.8 


3745 
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ctSPD'°? ??3_pAMK 


P2,2)2, 


56.35, 85.01, 119.27 
90, 90, 90 

50.00-1.85 (1.88-1.85) 
13.5 (100) 

19.8 (1.9) 

100 (100) 

7.5 (5.8) 


42.51-1.85 
50139 


18.0 / 20.7 


3730 
134 
454 


0.013 


1.05 


ILLUSTRATION BY THE PROJECT TWINS 


THE ROBOTS THAT TAP 
INTO BRAIN CELLS 


Neuroscientists hope to turn the delicate art of eavesdropping 
on neurons into an automated technique. 


BY HELEN SHEN 


lamping an electrode to the brain cell 
( ofa living animal to record its electrical 

chatter is a task that demands finesse and 
patience. Known as ‘whole-cell patch-clamping; 
it is reputedly the “finest art in neuroscience’, 
says neurobiologist Edward Boyden, and one 
that only a few dozen laboratories around the 
world specialize in. 

But researchers are trying to demystify this 
art by turning it into a streamlined, automated 
technique that any laboratory could attempt, 
using robotics and downloadable source code. 

“Patch-clamping provides a unique view 


into neural circuits, and it’s a very exciting 
technique but is really underused,” says neuro- 
scientist Karel Svoboda at the Howard Hughes 
Medical Institute’s Janelia Research Campus in 
Ashburn, Virginia. “That’s why automation is 
a really, really exciting direction” 

On 3 March, Boyden, at the Massachusetts 
Institute of Technology in Cambridge, and his 
colleagues published detailed instructions on 
how to assemble and operate an automated sys- 
tem for whole-cell patch-clamping', a concept 
that they first described in 2012 (ref. 2). The 
guide represents the latest fruits of Boyden’s 
partnership with the laboratory of Craig Forest, 
a mechanical engineer at the Georgia Institute 
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of Technology in Atlanta who specializes in 
robotic automation for research. 

Most neural recordings involve inserting an 
electrode in the space between cells to pick up 
electrical volleys between neurons. Such ‘extra- 
cellular recording’ detects outgoing signals but 
misses the electrical activity inside the cells 
that determines whether they will fire. This is 
where whole-cell patch-clamping comes in, a 
technique that can tap into a neuron’s innards. 
The delicate procedure “has a very steep learn- 
ing curve, and even then some people never 
really get it to work’, says Svoboda. 

Whole-cell patch-clamping involves push- 
ing a tiny glass pipette containing a wire 
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» electrode through the brain. In the most 
common, ‘blind’ version, researchers do this 
without being able to see the neurons. The 
scientist must continually apply pressure to 
push brain matter away from the pipette, but 
when a rise in electrode resistance indicates 
that a cell is nearby, they must switch to suc- 
tion at just the right moment to seal a tiny 
patch of the neuron’s membrane against the 
pipette’s super-thin tip. With an additional 
burst of suction, the researcher can then make 
a tiny hole in the cell membrane to record 
the neuron’s activity. Hitting the neuron at 
the wrong angle, misregulating the pressure 
and numerous other variables often derail 
recordings. 

“Every step has a certain failure rate, and 
these multiply throughout the process,” says 
Boyden. Experienced practitioners report 
success rates of between 20% and 60%. 

Boyden and Forest decided to automate this 
tricky technique. Their robot does not outper- 
form human experts yet, but its average success 
rate is around 33% in tests on mice. The device, 
which runs on the commercial programming 
platform LabVIEW, only requires researchers 
to position the animal and the pipette. A com- 
puter algorithm then controls the pipette'’s inter- 
nal pressure and its progression through the 
brain. A company called Neuromatic Devices 
in Atlanta, Georgia, offers machines based on 


Boyden and Forest’s technology, but did not 
disclose pricing or sales figures to Nature. 

At the University of Texas at Austin, research- 
ers have created a similar auto-patching system 
that is controlled in the MATLAB computing 
environment. This system uses a slightly differ- 
ent algorithm to decide when to start suction- 
ing, and it succeeds in patch-clamping cells in 
mice about 17% of the time’. Neuroscientist 
Niraj Desai, who led the team, says that he hopes 
to incorporate more-sophisticated algorithms. 

Some researchers question whether the 
recording robots will ever surpass the best 
human experts. “The elements that go into the 
human's decisions may be richer than can be 
captured by the machine,’ says neuroscientist 
Michael Hausser at University College Lon- 
don. But he adds that the technologies could 
still be a huge boon to novices. Others suggest 
that the robots could help users of all skill levels 
in lengthy or complex experiments, in which 
human fatigue becomes a limiting factor. 

At the Allen Institute for Brain Science in 
Seattle, Washington, researchers have devel- 
oped an automated system to assist in the even 
more challenging ‘image-guided’ variant of the 
technique. In this version, instead of blindly 
bumping into neurons with a pipette, scientists 
target specific neurons near the brain’s surface 
using a two-photon microscope. The proce- 
dure requires more coordination than blind 


patch-clamping because the scientist must 
constantly focus the microscope in addition to 
guiding the pipette and adjusting its internal 
pressure. “This is a technique that ideally would 
benefit from having three hands,” says Hausser, 
an expert in image-guided patching. 

The automated system constructs 3D images 
of the brain region of interest and allows users 
to digitally select the neuron that they want to 
record. Then, with the coordinates locked in, 
the device navigates the pipette into place. For 
now, researchers still need to patch onto the 
cell by hand, but Allen Institute neuroscientist 
and joint team leader Lu Li says that eventu- 
ally they hope to fully automate the procedure. 

Whether these automation systems will be 
taken up widely by the neuroscience commu- 
nity remains to be seen. Each of the teams has 
made their code freely available for people to 
download: Boyden’ group at autopatcher.org; 
Desai’s team at clm.utexas.edu/robotpatch; and 
Li's team at the GitHub repository (go.nature. 
com/sgjpab). “Our hope is that we can help as 
many people as possible to answer questions 
about how neurons compute,’ Boyden says. m 
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DATA SHARING 


Web widget nudges 
scientists to share their data 


Open Data Button launched to encourage public sharing of data sets. 


BY DALMEET SINGH CHAWLA 


free, web-based tool that promises to 
A= its users to ask authors of research 

papers to share their data publicly— 
and to make such requests publicly trackable 
— launched in beta version on 7 March. The 
Open Data Button (opendatabutton.org), a 
downloadable web-browser extension, can be 
clicked when a reader is looking at a research 
paper and wants to see its underlying data, 
says Joseph McArthur, who is co-leading the 
project and is assistant director of the policy 
advocacy group The Right to Research Coali- 
tion (R2RC) in London. 

When clicked, the button generates a tem- 
plate e-mail that the user can send to the paper's 
authors. It asks them to share the data support- 
ing the paper, explains how to do so and — if 
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the user has typed in the information — states 
why the data would be useful. All requests are 
simultaneously posted on the Open Data But- 
ton website, where anyone can comment on 
existing entries to note that they want access to 
the same data sets. 

If the author replies — either with a web 
link or attached data file — “we ask our users 
to affirm that this is the data they wanted’, 
says McArthur. The tool will chase authors 
once a week for four weeks after a request 
is filed, he adds, after which entries will be 
marked as ‘failed’ 

The project is mostly funded by a US$25,000 
grant from the non-profit Center for Open Sci- 
ence in Charlottesville, Virginia; the centre has 
promised to host any data files sent in reply on 
its own Open Science Framework repository. 
The aim is to encourage open data sharing, 
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which is still far from the norm in research, 
McArthur notes, even though many journals 
are now asking authors to publish their data 
alongside their research papers. 

“Tt is good to draw attention to an important 
problem such as data availability with a one- 
click gadget,” says Bernd Pulverer, chief editor 
of The EMBO Journal in Heidelberg, Germany. 
But a button by itself “is probably not quite 
going to cause the revolution’, he says. There 
is still a reluctance — especially among biolo- 
gists — to share data openly, because of fears of 
being scooped by competition, and because of 
the extra work required to make data sets open, 
he notes. Ultimately, McArthur hopes that the 
button will not be needed as it becomes the 
norm to share data openly. But progress could 
be slow — so he thinks that the tool can expect 
along and useful life. = 


MARK HUNT/HUNTSTOCK/CORBIS 


CAREERS 


JE Elite researchers identify 
scientific virtue p.139 


Anew PhD graduate counsels 
her younger self go.nature.com/xqq22d 


For the latest career 
listings and advice www.naturejobs.com 


Scientists who have disabilities use a growing array of specialized equipment to enable them to carry out research. 


DISABILITY AWARENESS 


The fight for accessibility 


An aisle too narrow, alab bench too high: the scientific world is a complex place for 
researchers with disabilities. But many of them find ingenious ways to make it work. 


BY ERYN BROWN 


enry “Hoby” Wedler caught the 
Hessen bug as a high-school stu- 
dent in Sonoma County, California. 
He loved to think about atoms and how they 
fit together to make molecules. Although he 
wanted to enrol in his school’s advanced chem- 
istry course, he faced a frustrating problem: he 
has been completely blind since birth, and his 
teacher thought that he might find it too hard. 
The field was too visual, she told him. 
Nonsense, Wedler thought. “I said, ‘No one 


can see atoms; it’s completely cerebral?” Now 
28, he is studying for a PhD in organic chem- 
istry at the University of California, Davis, 
and is considering teaching offers. His once- 
doubting school instructor has become one of 
his greatest allies. 

It’s not an easy road, but early-career scientists 
who face physical challenges such as blindness, 
deafness or paralysis are building varied and 
rewarding careers. They work in academic, 
government and industrial research as teach- 
ers, consultants and other occupations. 

Success requires desire, grit and ingenuity. 
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Researchers who have trouble seeing, hearing or 
with mobility need creative workarounds in the 
lab and field. They may have to design their own 
equipment. They need a crew of friends, peers 
and mentors who can provide support. And 
they must seek work that capitalizes on their 
strengths, accepting that some assignments may 
be beyond their reach — at least for now. 

“You have to really want to be a scientist a 
lot, or it won't happen,’ says Richard Mankin, 
a research entomologist for the US Depart- 
ment of Agriculture (USDA) in Gainesville, 
Florida, who wears braces on his legs and } 
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> relies on crutches to walk. 

Despite a patchwork quilt of policies and 
guidelines that are meant to broaden employ- 
ment opportunities, disabled people still have 
trouble finding work. A 2014 analysis by the 
Campaign for Science and Engineering, which 
promotes science in the United Kingdom, 
reported’ that fewer than half of the nation’s 
5.2 million people of working age with disabili- 
ties had jobs between 2010 and 2011. They were 
more than twice as likely as peers without dis- 
abilities to report working part-time, and about 
halfas likely to have jobs in science, technology, 
engineering and mathematics. 

Similarly, in 2015, the US National Science 
Foundation reported’ that about one in nine sci- 
entists aged 75 or younger in the United States 
had a disability. They, too, were more than twice 
as likely to be out of the labour force than their 
peers without disabilities. 

As Wedler found, accessibility issues crop up 
long before a scientist enters the workforce. Ger- 
man-born climatologist Imke Durre, blind since 
early childhood, says that her father, a computer 
scientist, created a Braille word-processing pro- 
gram in the early 1980s to help her to do school 
work. Such technology is widespread today, says 
Durre, who works for the US National Oceanic 
and Atmospheric Administration. 

But she says that there are still practical 
hurdles to using assistive tools, particularly for 
students. Even when Braille versions of text- 
books are available, teachers may not know 
about them or be able to order them in time 
for a class. Figures, tables and graphs typically 
arent translated into Braille, so a student with 
a visual impairment often needs to collaborate 
with a sighted colleague to interpret visual 
data — a process that may not go smoothly, 
says Durre. On occasion, she says with a laugh, 
she has shown up to a meeting carrying blank 
pages instead of the hard copies of graphs and 
tables that she thought she had prepared. 


AFLEXIBLE SPACE 

Mundane tasks pose barriers, too. James 
McNutt, who uses a wheelchair and studies the 
history of science at Queen's University in King- 
ston, Canada, set out last year to record some of 
the difficulties that he faces as he travels around 
the university (see go.nature.com/oaablb). With 
a video camera attached to his wheelchair, he 
attempted to open doors, operate lifts and visit 
washrooms. Buttons, switches and knobs were 
often out of reach. Doors wouldn't open. Pas- 
sageways were too narrow or too winding to 
accommodate a wheelchair easily. 

“Quite often, the planning people don’t have 
an idea of what is and is not accessible,” says 
McNutt, who has cerebral palsy. “They don't 
know how big the wheelchair is. They don’t have 
any idea what it’s like” 

Audrey Kobayashi, a geographer at Queen's 
University and a member of the school’s com- 
mittee on campus accessibility, says that the 
situation has improved greatly over the past two 
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GROUP ACCOMMODATION 


A worldwide wave of awareness 


@ The STEMM Disability Advisory 
Committee, a partnership between the 
Royal Society and other scientific groups 

in the United Kingdom, pushes to improve 
funding for scientists with disabilities, and 
hosted a conference in March focusing on 
the transition from study to the workplace. 
@ The American Association for the 
Advancement of Science led early on in 
making conferences accessible, and sponsors 
Entry Point!, a programme that places young 
scientists with disabilities at internships with 
university, government and industry labs. 

@ Sang-Mook Lee’s Quality of Life Technology 
initiative at Seoul National University, funded 


decades. There are now clear lines of respon- 
sibility to ensure that students with disabilities 
get any help they need, and the movement is 
garnering attention (see ‘A worldwide wave of 
awareness’). But Kobayashi, who uses a wheel- 
chair at work because ofa neurological disorder 
called transverse myelitis, says that a lot of 
challenges remain. “We're trying to make a bar- 
rier-free campus, but it’s slow,’ she says. 

The difficulties don’t end after receiving a 
degree. Researchers who have a disability and 
want to work in a scientific field must first 
ensure that physical adjustments are made to 
labs and other workplaces to facilitate access. 
These can include redesigning lab sinks to 
accommodate a wheelchair, posting emer- 
gency instructions in Braille and checking that 
doorways to halls with lifts or ramps don’t lock 
automatically and block exits. 

In the United States, the Americans with 
Disabilities Act, enacted in 1990, requires 
accommodations in public spaces. The UK 
Disability Discrimination Act (1995) and 
Canada’s Accessi- 
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often do not require 
retrofitting, and thus 
many labs that were 
built before the legislation took effect remain 
difficult or impossible to navigate for scientists 
who have mobility issues: aisles are too nar- 
row, tables are too tall and eyewash stations are 
tucked into inconvenient corners. 

Some scientists are attacking the problems on 
their own. Neuroscientist Bradley Duerstock, 
who has been quadriplegic for more than 
25 years, developed a disability-friendly lab 
space at Purdue University’s Center for Paralysis 
Research in West Lafayette, Indiana, where he 
researches assistive technologies. With grants 


accessible.” 
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by the South Korean government, promotes 
adaptive technologies for disabled students. 
@ Analytical chemist Karl Booksh at the 
University of Delaware in Newark runs a 
programme for disabled undergraduates that 
prepares them for graduate work. 

@ The American Chemical Society’s Chemists 
With Disabilities committee has published a 
guide to teaching disabled chemistry high- 
school, college and graduate students’. 

@ The International Association for 
Geoscience Diversity, in Cincinnati, Ohio, 
seeks to improve accessibility for students 
and scientists with disabilities in the Earth 
sciences. EB. 


from the US National Institutes of Health and 
other sources, he adapted the ‘kitchen work 
triangle’ — a home-design layout concept that 
imagines the stove, sink and refrigerator as the 
shape’s corners — into a ‘wet-laboratory work 
triangle’ defined by the lab sink, lab bench and 
fume hood. Years ago, he designed a microscope 
that lets the user control illumination, focus and 
exposure through a computer interface, rather 
than tricky-to-operate knobs (now, such micro- 
scopes are commercially available). 

Scientists with disabilities are also installing 
light switches that are positioned at an accessible 
height and that are labelled for people who rely 
on Braille, and they use adjustable-height lab 
benches and other accommodations. 

Back in California, Wedler gives credit to 
his adviser, chemist Dean Tantillo, for hiring 
him as an undergraduate and for making the 
lab an easier place to work. With the help of 
US$30,000 in supplemental funding attached 
to his US National Science Foundation gradu- 
ate fellowship, Wedler and other members of 
Tantillos team developed a 3D-printing solu- 
tion that produces tactile models of molecular 
structures. Different shapes and textures in the 
models represent different atoms and bonds, 
and Braille notations describe bond angles and 
bond lengths. It takes a couple of days to gen- 
erate each printed model, but Wedler doesn't 
mind: the homegrown technology allows him 
to ‘feel’ the outcomes of his calculations so that 
he can verify his work on his own. 

When Duerstock and Susan Mendrysa, then 
a colleague at Purdue's Institute for Accessible 
Science, polled coworkers to learn what quali- 
ties make a lab most accommodating, they 
found that working with an established, well- 
funded principal investigator was key: such 
scientists were more likely to give staff with 
disabilities the extra time they might need to 
complete their studies or to publish papers. 

Colleagues can also help on a smaller scale. 
Born with muscles missing in his legs and 
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Entomologist Richard Mankin (right) does work 
for the US Department of Agriculture in Guam. 


arms, Mankin has conducted field research 
internationally since the 1970s, walking with 
crutches or crawling along the ground to 
study the sounds and vibrations that insects 
make in various locations. He seldom works 
in the field alone, and he keeps his trips 
short. He asks those who accompany him to 
manage tasks that he cannot perform, such 
as carrying equipment and climbing trees. 

Physical barriers are not the only obsta- 
cles: bias can also be an issue. Jae-Hyeon 
Parq, a postdoctoral researcher at Seoul 
National University, who has used a wheel- 
chair since sustaining a spinal injury as an 
undergraduate, worries that his disability will 
make it hard for him to find a job. Trained 
as a physicist, Parq now works in the lab of 
marine geologist Sang-Mook Lee, who has 
been trying to improve conditions for scien- 
tists with disabilities since 2006, when he was 
paralysed in a car accident. 

“Most people, especially in Korea, don’t 
understand the diversity of disabled people,” 
Parq says. “They judge what I can and what 
I can't do from my appearance.” If Parq can't 
get a permanent job, he says, he will continue 
to work for Lee. 


PUBLIC SECRETS 
Those whose disabilities aren't as imme- 
diately obvious face a different, yet related 
problem: whether to tell potential employers. 
“One of the most common questions I get is, 
should I say on my CV that I’m deaf?” says 
biochemist Annemarie Ross of the Rochester 
Institute of Technology's National Technical 
Institute for the Deafin New York. Ross, who 
is hearing-impaired, tells students that it is 
their choice — there is no clear advantage for 
applicants who do or don't reveal a disability. 
But it’s a challenge that must be resolved, she 
says. “A big barrier in general for our students 


are the employers. They think, ‘Ifa worker 
cant hear a fire alarm, how do we make sure 
they're safe? If they stay behind in a burning 
lab, we could be liable” Often, job candidates 
must persuade employers to reframe their 
assumptions in interviews, Ross says. Those 
with hearing disorders, for example, can see 
the strobe lights on many modern fire-alarm 
systems. By the same token, scientists in a lab 
don't spend much time doing physical tasks. 

“I was always having to persuade people I 
could do things from a wheelchair,’ says Karl 
Booksh, an analytical chemist at the Univer- 
sity of Delaware in Newark who experienced 
a spinal cord injury in university. “The way I 
convinced most of them was pointing out that 
the most successful faculty members didn't 
know where the pipettes were to begin with 
— that the key to success was writing papers 
and proposals.” 

Some scientists with disabilities have 
reframed their impairment as a positive 
attribute: they say that coping with the chal- 
lenges of everyday life has helped them to 
develop unusual skills and expertise. Wedler, 
for instance, says that navigating town trained 
his brain to make spot-on mental maps. A 
similar sort of spatial thinking helps him with 
organic chemistry. “I was thinking in terms of 
feet and miles, but there’s no reason you can't 
shrink that down to angstroms,” he says. “In 
terms of doing the problems, I might have an 
advantage over my sighted peers.” 

Mankin is dubious that the stigma against 
those with disabilities will ever fade com- 
pletely. He is president of the Foundation 
for Science and Disability, which sponsors a 
grant programme that supports the research 
of graduate students with disabilities. But, he 
says, he doesn’t think of himself as disabled. 

He is an enthusiast whose voice crackles 
with excitement when he talks about his work. 
He is studying psyllids, insects that cause a 
tree-damaging disease that threatens Florida's 
$10-billion citrus industry, and he has been 
developing systems that use vibrations to lure 
and trap male psyllids to prevent them from 
mating with females nearby. The approach 
could offer an alternative to pesticides, and has 
attracted the attention of federal legislators. 

“Being a scientist has been lots of fun,” 
Mankin says. “I’ve done things that I hope 
have benefitted humanity. This is what I 
always wanted to do.” = 


Eryn Brown is a freelance writer in Los 
Angeles, California. 
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CHARACTER TRAITS 
Scientific virtue 


Honesty and curiosity are the most 
important traits underlying excellent 
science, according to a survey of around 
400 members of elite US scientific 
societies, such as the National Academy 
of Sciences. A pilot study led by survey 
co-organizer Robert Pennock, a 
philosopher at Michigan State University 
in East Lansing, had previously identified 
the ten most widely held values among 
scientists who have been honoured by 
their peers for being exemplary. Although 
honesty and curiosity dominated, these 
virtues also included perseverance, 
objectivity and the willingness to abandon 
a preferred hypothesis in the face of 
conflicting evidence (see ‘Core values’). 

Little empirical research has been done 
to learn what traits scientists value most 
in one another, says Pennock, and this 
work indicates a high level of consensus 
among elite US researchers about what 
is important for the practice of science. 
He thinks that training programmes that 
emphasize such shared scientific values 
are likely to be more effective than are 
those that focus on compliance with 
official rules of behaviour; 94% of the 
scientists surveyed felt that scientific 
virtues can be learned. 

About four in five of those surveyed feel 
that today’s trainees share the scientific 
values that they themselves held when 
training, and 88% take candidates’ 
scientific character traits into account 
when recruiting lab members. The team 
members presented their preliminary 
results at a meeting of the American 
Association for the Advancement of 
Science in February (see go.nature.com/ 
o4urjl), and they plan to publish full 
results from a sampling of 500 established 
scientists, in addition to a similar-sized 
group of early-career scientists, in 
upcoming months. 


CORE VALUES 


Elite scientists were asked which three values 
they consider to be the most important. 

Honesty 

Curiosity 

Perseverance 

Objectivity 

Humility to evidence* 

Attentiveness 

Scepticism 

Courage 

Collaborative 

Meticulousness 


*Willingness to abandon a preferred hypothesis when faced 
with conflicting results. 
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Ua SCIENCE FICTION 


CHOICES, IN SEQUENTIAL ORDER 


BY KARLO YEAGER RODRIGUEZ 


S1INGLE-AX1S DESIGNS: 

THE KEY TO LIFE! (COPYRIGHT 2349). 
START 

PLANET: EARTH / TAU CETI E 

Tau Ceti e. 

REDIRECTING TO XENOBIOTA... 

You're killing me, down in this tunnel 
you've made, but I feel like I can only talk to 
you. Strange, huh? I feel euphoric. I wonder 
if it’s the low oxygen levels on the planet's 
surface. I feel it’s because of you. I'm 12 light 
years away from home, paralysed in a hole 
in the ground, on a hell-planet, but you're 
everything I ever dreamed in my life. 

CAN IT MOVE? PRESENT / NOT PRESENT 
Present. 

SYMMETRY: RADIAL OR PENTAMERISM / 
BILATERAL 

Bilateral. 

Because of you, the promise of you, Vee 
and Mom agreed. The one time they agreed 
on something, y know? They joined forces, 
but I was ready. I told them Pop would’ve 
jumped at the chance I was given. 

Xenobiologist! An advance team. Pop 
would have been so proud! The only space 
exploration he knew was contained in the 
paperbacks he read until he couldn't read 
any longer. When that happened, I read to 
him. The breathing machines shushed me, 
the rattle and hiss of air pumped in and out 
of Pop's lungs. His eyes closed, I’m sure he 
dreamed of the vast frontier. And one day — 
one day, the machines stuttered into stillness. 
I knew the time had come. With Pop gone, 
why did I have to stay Earthbound? 

After I beam the dichotomous key data 
back, Pll suggest naming you after my Pop. 
Xenopendra pérezii has a good ring. 

When I mentioned his wishes, Vee and 
Mom said it wasnt fair. What does fair have 
to do with anything? 

PANARTHROPODA 

JOINTED LEGS? YES / NO 

Yes. 

HOW MANY ANTENNAE? TWO PAIRS / 
ONE PAIR OR NONE 

It only has one pair. 

It’s getting dark in here. How's the saying go? 
Better to light a viewscreen than to curse the 
darkness, huh? Good thing my suit’s helmet 
has a heads-up display. Best thing, the display 
— Ihave something to do while you're gone. 
Auxiliary team needs to know about you. 

Thank you for pinning me down. Prob- 
ably break my neck. I’m dead either way. 
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Into the unknown. 


My cheek itches. 

Where are —? Oh — youre fast! You must 
not need visible light to see, huh? 

Oh, I — I guess maybe I moved, but that 
glued me to the spot. Won't move anymore. 

I can feel them wriggle around, now. I 
wonder if they'll tickle when they hatch? 
NUMBER OF LEGS? THREE PAIRS / 
FOUR OR MORE PAIRS 
Four or more pairs. 

NUMBER OF LEGS PER SEGMENT? TWO 
PAIRS / ONE PAIR 
One pair. 

Cantt feel my fingers. There’s a hot breeze 
across my belly, though. The crosswind tells 
me you must have dug out another exit. The 
airflow leaves my mouth full of sand. I can 
feel it across my belly, too — suit’s torn open. 

The diagnostic module doesn't respond. 
You must have damaged it when you ripped 
it open. It could’ve analysed for poison. 
Vitals circuit's running. Redundancies: good. 
The beacon’ activated if vitals drop to null. 

Fangs — forcipules, yes, not fangs — must 
deliver a weak toxin. Is the euphoria caused 
by it alone, or is it a factor? What about low 
oxygen? Temperature? 

I must've pissed myself — I can smell it. 

Pressure like a clenching fist on my abdo- 
men. Now, it unclenches and flutters open, 
moves, clenches again. 

Why am I so — accepting? I know — my 
brain knows — your young will devour me. 
Why can't I feel panic? Fear? I should stop — 
why can't I stop talking to you? 

NUMBER OF TRUNK SEGMENTS? 12 / 
MORE THAN 12 

More than 12. Much more. 

FORCIPULES? YES / NO 

Yes. 

Vee never listened — ‘no’ meant getting 
what she wanted suffered a setback. She 
pushed and pushed and pushed — always 
what she wanted. Exhausting. She wanted to 
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have kids — she tells me this the night before. 
Incredible, right? I'm ready to go to the facility, 
to get scrubbed, suited up, loaded into a high- 
tech version ofa tin can and shot across the 
vast reaches of space. She wants to talk babies. 

Strange — before, Vee had shouted down 
to me from her place on the corporate lad- 
der: a few more years! ignored her. 

Where are you? I hear a scraping sound at 
the far end of the tunnel — are you digging? 
Are you going to leave me here? I suppose I 
should expect it. I would have thought you 
might watch over your babies. 

No. Larvae — the right word is larvae. 
Certain species coil around their egg clus- 
ter. Not you. 

So — babies. Oh, you should have seen 
Vee break stuff! She even threatened me 
at knifepoint, blocked my way out of the 
house. The security team made sure I got to 
the facility in time, though. They hustled me 
out of the house, through the prep stations. 
It's funny. Now I remember: they wanted to 
make sure I didn't bring any bugs with me. 
Hilarious, right? 

CHILOPODA? YES / NO 
No. 

RECALCULATING... 
RECALCULATING... 
RECALCULATING... 

SIZE: A METRE OR LESS / MORE THAN 
A METRE 
More than a metre. 

CURVED TAIL, WITH OVIPOSITOR? 

YES / NO 
That an ovipositor in your pocket? Or are you 
happy to see me? 

‘THETA OVIPOSITOR AIR POCKET 
ARE YOU HAPPY TWO SEAMY’ IS NOT 
ACCEPTED. PLEASE ANSWER IN ‘YES’ 
OR ‘NO’ FORMAT 
Yes. 

Yes. 

It’s happening. 

They do tickle and my breath is ragged as 
I suppress laughter. The viewscreen flashes 
once, flickers and goes out, switches to aux- 
iliary power. I am part of something bigger, 
something more, now. 

ALERT! DANGEROUS XENOFORM 
ACTIVATING RESCUE BEACON 
SOS SOS SOS m 


Karlo Yeager Rodriguez is from Puerto 
Rico, but moved to Baltimore, Maryland, a 
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